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PREFACE 



Historically, array transform processors have been 
largely integer-arithmetic devices, since the 
slower processing rate of floating-point arithmetic 
was undesirable when working with large arrays of 
data. However, integer methods have problems which 
make programming awkward due to the limited dynamic 
range of integer arithmetic. Array scaling and 
block floating-point techniques either allowed 
human and other errors to creep into the results or 
were costly and time consuming. Further, as 
processing became more sophisticated, even 16-bit 
integer data words were insufficiently precise for 
preserving the accuracy of simple 8-bit 
analog-to-digital converted input data. This is 
because the many multiplications and additions in 
typical cascaded array processing can cause the 
propagation of truncation errors. 



NOTE 

A 16-bit integer multiplied by 
a 16-bit integer results in a 
32-bit product. If the result 
is truncated to the 16 most 
significant bits, then half the 
time the resultant's least 
significant bit (LSB) is wrong 
since it should have been 
rounded up. Now the product of 
two of these potentially wrong 
LSB numbers results in the next 
LSB being wrong part of the 
time; thus cascaded operations 
propagate the errors leftward 
toward the most significant 
bits. 



FPS 860-7259-003 P - 



With the advent of faster digital logic, many users 
realized that floating-point processing makes 
programming easier, virtually eliminates dynamic 
range problems, greatly alleviates the precision 
problem, and is potentially as fast as the last 
generation of integer processors. Floating Point 
Systems, Inc., recognized this trend in 1970 and 
was formed to specialize in floating-point 
processors. 



The rush to floating-point processing was not a 
smooth one. Many floating-point formats sprang up 
and Floating Point Systems became expert in format 
converting on-the-fly so processing time would not 
be lost during a format conversion. Why convert 
formats? Simple. Not all formats are 
mathematically clean. For example, it is unwise to 
use a hexadecimal-exponent format for serious 
number crunching because a hexadecimal 
normalization can cause as many as three leading 
zeros between the binary point of the mantissa and 
the first significant bit. This means that as many 
as three least-significant bits may be lost, due to 
right-shifting the mantissa past the available word 
length (truncation) when an extreme hexadecimal 
normalization occurs (about 25 percent of the 
time), and, of course, 2, 1, or no bits may be lost 
(with equal probability) for other possible 
hexadecimal cases. Cascaded calculations can 
quickly cause the low-resolution three-leading-zero 
data words to contaminate a data base. 



The FPS solution is to use a true 10-bit binary 
exponent, which has more dynamic range than the 
standard 7-bit hexadecimal or 8-bit binary 
exponent. FPS then uses a 28-bit mantissa, plus 
three guard bits in the adder and a double mantissa 
at the multiplier output, which provides enough 
bits to not only allow for hexadecimal in/out 
formats, but also to carry enough information to 
permit post-normalization and convergent-rounding 
after each arithmetic operation. Thus, FPS can 
receive any reasonable floating-point format that 
is desired as the input format, convert it 
on-the-fly to the FPS format, process it in FPS 
format with minimal truncation error propagation, 
and then convert it on-the-fly to the desired 
output format. This procedure allows transparent 
no penalty operation on the data, thus preserving 
the integrity of the input data. 
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In addition to the well chosen floating-point 
format, the AP has a general-purpose, multi-bus 
oriented architecture for the arithmetic units. 
This allows great flexibility in that operands and 
resultants can be moved simultaneously from almost 
any register in the AP to any other. This rather 
generalized structure of the AP allows it to 
execute specialized algorithms, such as the FFT, in 
times comparable to those achieved by hardwired 
special-purpose processors and also makes the AP 
well suited to less highly organized computations. 



In the matter of software, note that this machine 
is a synchronous monolithic multiprocessor, as 
opposed to an asynchronous multiprocessor. The 
practical significance of this is that programming 
by the user and/or FPS (Standard Algorithms, System 
and Test Software) is tremendously simplified due 
to the predictability of data flow and timing 
considerations. There is no need for internal 
hand-shaking between arithmetic units , memories , 
and microprocessor; data and results are available 
at precisely determined times. The synchronous 
approach not only allows a non-stochastic simulator 
to be written for easy program debugging, but in 
addition, programs may be single-stepped in the 
real processor, with execution identical to 
free-running programs. A further bonus of the 
synchronous design is the easy producibility, 
maintainability, interchangeability and reliability 
(there is no need to explore an infinite number of 
possible timing conditions as one clock phases by 
another, as happens in an asynchronous machine). 
Convenient and rapid data-dependent branching, 
simple overlapping of data input, arithmetic 
processing, and data output are further examples of 
the care taken to assure a fast, accurate, 
convenient, and reliable array processor. 
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CHAPTER 1 



GENERAL INFORMATION 



1.1 INTRODUCTION 

The AP is a high-speed (167ns cycle time) peripheral floating-point 
arithmetic array processor (AP), which is intended to work in parallel 
with a host computer. 

The AP's internal organization is particularly well suited to 
performing the large numbers of reiterative multiplications and 
additions required in digital signal processing, matrix arithmetic, 
statistical analysis, and numerical simulation. 

The highly-parallel structure of the AP allows the overhead of array 
indexing, loop counting, and data fetching from memory to be performed 
simultaneously with arithmetic operations on the data. This allows 
much faster execution than on a typical general-purpose computer where 
each of the above operations must occur sequentially. 

The AP achieves its high speed through the use of fast commercial 
integrated circuit elements and an architecture that permits each 
logical unit of the machine to operate independently and at maximum 
speed. 
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Specifically: 



• Programs, constants, and data each reside in separate, 
independent memories to eliminate memory accessing 
conflicts. 

• Independent floating-point multiply and adder units allow 
both arithmetic operations to be initiated every 167ns. 

• Two large (32 locations each) blocks of floating-point 
accumulators are available for temporary storage 

of intermediate results from the multiplier, adder, 
or memory. 

• Address indexing and counting functions are performed by an 
independent integer arithmetic unit that includes 
16-integer accumulators. 



In a typical application, such as a fast fourier transform (FFT), the 
above features allow nearly the entire computation to be overlapped 
with data memory access time. 

Effective processing precision is enhanced by 38-bit internal data 
words, an internal floating-point format with optimum numerical 
properties, and a convergent rounding algorithm. 
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1.2 SYSTEM OVERVIEW 

A general block diagram of AP arithmetic paths appears in Figure 1-1. 

Connection is made to the host in a manner that permits data transfers 
to occur under control of either the host computer or the AP. For most 
host computers, this means that the AP is interfaced to both the 
programmed I/O and DMA channels. 

The system elements are interconnected with multiple parallel paths so 
that transfers can occur in parallel. All internal floating-point data 
paths are 38 bits wide (10-bit biased binary exponent and 28-bit 2's 
complement mantissa) . 

Main data memory (MD) is organized in 8K-word modules of 38-bit words 
expandable up to 64K words in the main chassis. The effective memory 
cycle time (interleaved) is 333ns. 

Table memory (TM) is used for storage of constants (FFT constants) and 
is tied to a separate data path so as not to interfere with data 
memory. It Is bi-polar 167ns read-only memory and is organized in 
512-word, 38-bit increments. 

Data pad X (DPX) and data pad Y (DPY) are two blocks of 32 floating 
accumulators. Each is a two-part register block, wherein one register 
may be read and another written from each block in one instruction 
cycle. 

The floating adder (FA) consists of two input registers (Al and A2) and 
a two-stage pipeline which performs the operations and convergently 
rounds the normalized result. 

The floating multiplier (FM) consists of input registers (Ml and M2) 
and a three-stage pipeline which performs the multiply operation. 
Products are normalized and convergently rounded 38-bit numbers. 

The s-pad consists of 16 integer registers and an integer arithmetic 
unit which is used to form operand addresses and to perform integer 
arithmetic. 

Chapter 2 contains a more detailed description of each of the 
functional elements. Chapter 3 describes programming considerations. 

Chapter 4 describes in detail the host computer interface, which 
Floating Point Systems, Inc., supplies. A number of off-the-shelf 
interfaces are available. 
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Figure 1-1 General AP Block Diagram 
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1.3 EXAMPLE AP APPLICATION 

A simple FFT processing sequence goes as follows: 

Initial conditions are that the FFT program is resident in program 
source memory internal to the AP, the array to be transformed is 
resident in host memory, and the host CPU has initiated the AP 
processor with an I/O instruction. 



1. The AP requests host DMA cycles to transfer the array 
from host memory to internal data memory. Data is 
converted from host floating-point format to internal 
AP floating-point format on-the-fly. 

2. The FFT algorithm is performed with data remaining in 
internal AP format. This yields the benefit of 38-bit 
precision and convergent rounding during the critical 
phases of processing. 

3. The frequency domain array is transferred back to host 
memory by requesting host DMA cycles. Data is converted 
from internal format to host format on-the-fly. 

4. The AP proceeds to another process or stops executing, 
depending on previously established conditions. 

An interrupt to the host can be issued. 



The AP is most efficiently used when a sequence of operations is 
performed on one or more sets of data which reside in internal data 
memory. This reduces data transfer overhead and retains maximum 
numerical precision. For example, a reasonable sequence would be to 
transfer a trace and a filter, FFT both, array multiply, inverse FFT, 
and transfer the result back to host memory. 

The AP data memory has DMA capability. That is to say, MD cycles can 
be stolen from the AP microprocessor by the interface. This capability 
allows host computer DMA to AP DMA data transfers to occur, thereby 
minimizing both host CPU and AP overhead. 

The AP is designed with enough flexibility built in so that its power 
can be harnessed in a variety of ways. Subsequent sections describe 
its use in detail. 
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1.4 PHYSICAL DESCRIPTION 

The following sections describe the AP hardware. 



1.4.1 GENERAL 

The AP is available in rack configuration. Mounting is as a standard 
19- inch EIA rack-mounted unit requiring 24-1/2 inches of vertical 
space. The unit is equipped with rack slides permitting easy access to 
the etched and/or wire-wrapped circuitry with the chassis mounted on 
the forward portion of the unit. The power panel is mounted at the 
rear. One and three-quarter inches of space should be available above 
and below the 24-1/2 inches of the processor. This is for proper 
intake and exhaust of air through the processor. The control panel 
(refer to section 1.4-4) and/or blank panels may be used for proper 
spacing if the customer's equipment mounted above and below the 
processor does not have the proper free-air space built into it. 
Intake air should be between 10 and 40 degress centigrade. 



1.4.2 FORWARD UNIT 

The forward unit contains all AP circuitry except the power supply. 
There is provision for up to 31 15-by-10-inch etched-circuit boards 
(ECB). The ECBs plug into a mother board. The ECBs are arranged in £ 
vertical plane (chimney style) with push/pull fans to assure adequate 
upwards air circulation even in the event of a fan failure. The I/O 
cable exits at the bottom rear (the exact configuration is computer 
dependent). This unit is called the processor. 



1.4.3 REAR UNIT 

The power supply consists of three assemblies. The first is the main 
+5 volt supply and is capable of 100 amperes output. The other smaller 
supplies are -5 and +12 volts. The power supplies have forced 
convection cooling. All supplies are rear-mounted, along with the line 
box (containing line filters and contactor), on the power panel. 
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1.4.4 POWER, CONTROLS, AND INDICATORS 

The AP is expected to be normally powered up and down with the rest of 
the system. The AP switch and indicators are on a control panel. 
There is a single power cord (US standard three-wire with ground) which 
must be connected to 105 to 125 volts, 50 to 60 hertz. The service 
should be rated for 20 amperes (10 amperes in the case of the higher 
ranges) in order to provide a low-impedance source (power required is 
approximately 1200 volt-amps). The control panel may be mounted above 
or below either the processor or the power panel. Availability of line 
power is indicated by a neon LINE VOLTAGE indicator. If the ON /OFF 
switch is on, then power supplies should come on. There are two 
operation indicators: one shows array processor action and the other 
shows DMA transfers. The three individual power supplies have separate 
indicators (electroluminescent diodes). There are no external 
adjustments. The internal adjustments are the three power supply 
setting potentiometers on the power panel. 



1.4.5 SERIAL NUMBERS: 

The processor has a serial number tag on its starboard side near the 
top and forward ending in A. The power panel tag, ending in B, is 
located inside and near the top. The control panel has its tag ending 
in C, also inside. 
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6' POWER CORD 



COMPUTER INTERFACE 
REQUIREMENTS CABLE 

1) ENVIRONMENT: - 40 J C ? 2 - 90% RELATIVE HUMIDITY. 

(DERATE 1°C =>ER 2500 FT. (762 M) ABOVE SEA LEVEL, 5°C FOR 50 HERTZ OPERATION.) 

2) POWER CONSUMPTION » 1000 W; SERVICE: 

A. 105 - 125 VOLTS, 50 - 60 HERTZ 3 20 AMPS. (VOLTAGE OPTION "A" HAS A WHITE WIRE IN THE PAN POWER CABLE.) 

B. '88 - 223 VOLTS 50 - 60 HERTZ 3 10 AMPS. (VOLTAGE OPTION "B" HAS A BLUE WIRE IN THE FAN POWER CABLE.) 

C. 210 - 250 VOLTS, 50 - 60 HERTZ a 10 AMPS. (VOLTAGE OPTION 'C" HAS A RED WIRE IN THE FAN POWER CABLt.) 

D. LOW IMPEDANCE SERVICE AOVISED. 

3) SPACE: 

'HEIGHT: WITH CONTROL PANEL AT THE FRONT; 24V (62.23 CM). 
WITH CONTROL PANEL AT T HE REAR; 22k" (57.79 CM). 
WIDTH: 29" (48.26 CM). 
DEPTH: 20 - 25" (50.30 - 63.50 CM). 

CAUTION - ALLOW AT LEAST 1.75" OF FREE AIR SPACE ABOVE THE AP IF USED AS SHOWN. IF THE CONTROL PANEL IS 10VED, 
ALLOW 1.75" OF FREE AIR SPACE 3EL0W THE AP. 



NOTE: THE =OWER PANEL TO AP POWER CABLE IS LOCATED ON THE lOWER RIGHT SIDE (NOT SHOWN). 



Figure 1-2 AP Physical Configuration 
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1.5 SOFTWARE 

Four software packages can be supplied with the AP which assist the 
user toward the solution of the particular processing task. 



1.5.1 APEX (AP EXECUTIVE) 

APEX is a mechanism for communicating with the AP via a series of 
FORTRAN or machine language subroutine calls. The executive driver 
routine interprets the particular user call and directs the AP to 
perform the specified action. For example, in FORTRAN, to load an 
array A containing N real data points into the AP and perform a real 
fast fourier transform upon that data: 



IA-0 

CALL APPUT (A,IA,N,2) 

CALL RFFT (IA,N,1) 



Both the standard applications subroutines described below and 
user-developed AP programs may be called from the host computer using 
APEX. 



1.5.2 APMATH (AP MATH -LIBRARY) 

There are 239 subroutines written in AP assembly language. They are 
callable from the host computer FORTRAN or machine language using APEX. 
They are listed in Table 1-3. 
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1.5.3 PROGRAM DEVELOPMENT PACKAGE 

Six FORTRAN IV programs, which are compiled on the host computer during 
installation, aid user program development. 

These are: 



APAL 



AP assembly language. Cross-assembler 
which provides a two-pass assembly of 
symbolic coding into an object module. 
APAL generates detailed error diagnostics. 



APLOAD 



AP loader. Links and relocates 
separate APAL and AP-FORTRAN object 
modules together. It produces a load 
module and a host FORTRAN subroutine 
which transfers the load module to the 
AP. 



APDBUG 



AP debugger. Interactive debugging 
program. The user may selectively set 
breakpoints, examine and change memory, 
and register contents and run program 
segments . 



APSIM 



AP simulator. Called by APDBUG, APSIM 
provides a programmed simulation of the 
various hardware elements of the AP. 
All timing characteristics of the AP 
are emulated and the floating-point 
arithmetic is simulated (including 
rounding) to the least significant bit. 
APSIM is a convenient tool in bringing 
up new AP programs off-line without 
interf erring with production runs. 



VFC 



Vector Function Chainer. A translator 
to convert VFC syntax to AP assembly 
language (APAL). It consolidates 
multiple CALLS to the AP from the host 
computer into one CALL whenever possible. 
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AP-FORTRAN 



Array processor FORTRAN. A compiler 
which allows FORTRAN subprograms to 
execute on the AP. The compiler produces 
object modules which are used as input to 
the AP loader (APLOAD). 



1.5.4 APTEST (AP TEST PROGRAMS) 

APTEST is a collection of interactive diagnostic tests and verify 
programs which aid in isolation of hardware faults. 

These are: 



APTEST 



AP tester. Exercises the panel, DMA 
interface, and various internal registers 
and memories. Tests main data memory with 
simple patterns and then with random 
numbers. Board-level diagnostic indicators 
are provided. 



APPATH 



AP path tester. Tests the various 
internal data paths and gives board 
level diagnostics. 



APARTH 



AP arithmetic test. Tests the 
floating-point adder, multiplier, and 
s-pad arithmetic unit with pseudorandom 
number and operation sequences. 



FIFFT 



Forward /inverse FFT test. Verifies 

the correct operation of the AP 

as a complete unit by doing 

forward /inverse FFT transforms on both 

spikes and random number sequences . 
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Table 1-1 Floating-Point Arithmetic Times 



OPERATION 


TRAVEL TIME 


PIPELINE INTERVAL 


Add/Subtract 


2.333 us 


0.167 us 


Multiply 


0.500 us 


0.167 us 


Multi ply-Add 


0.833 us 


0.167 us 


Complex Add/Subtract 


0.530 us 


0.333 us 


Complex Multiply 


1.333 us 


0.667 us 


Complex Multiply-Add 


1.667 us 


0.667 us 



0983 



Travel time is the total time required to get from the data source to 
the destination including the full transport through the arithmetic 
units. Pipeline interval is the time between successively available 
resultants. The former is important when the successive arguments of a 
computation depend on previous calculations. The latter is indicative 
of the maximum throughput rate available for successively independent 
calculators. 
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Table 1-2 Basic Scalar Functions 



OPERATION 


TYPICAL EXECUTION 
TIME/LOOP Cus) 


PROGRAM SIZE 
(AP PS WORDS) 




167 ns 


333 ns 


157 ns 


333 ns 


Divide 


3.8 


3.8 


28 


23 


Square Root 


3.8 


3.8 


28 


28 


Exponential 


4.2 


4.2 


28 


28 


Natural Logarithm 


4.0 


4.0 


37 


37 


Base 10 Logarithm 


4.7 


4.7 


37 


37 


Sine 


4.9 


4.9 


35 


35 


Cosine 


5.4 


5.4 


35 


35 


Arctangent 


8.7 


8.7 


74 


74 


Arctangent of (Y/Xl 


13.3 


13.8 


74 


74 



0984 



These functions take arguments from data pad and return full-word 
accuracy results to data pad. Full-precision polynomial coefficients 
for these functions are contained on the standard 512 words of table 
memory . 
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Table 1-3 Summary of AP FORTRAN Callable Routines 









Typ: 


Leal 


Program 








Execution 


Size 




Name 


Operation 


Time, 


/Loop 


(AP 










(us) 


PS wor 


ds) 




167 


! 333 


167 | 


333 




DATA TRANSFER AND CONTROL OPERATIONS (APEX) 






APPUT 


PUT DATA INTO THE AP 


#.# 


#.# 








APGET 


GET DATA FROM THE AP 


#.# 


#.// 








APCLR 


INITIALIZE THE AP 


#.# 


#.# 








APWD 


WAIT FOR AP DATA TRANSFER 


#.# 


#.# 








APWR 


WAIT FOR AP PROGRAM EXECUTION 


#.# 


#.# 








APWAIT 


WAIT FOR AP 


#.# 


#.# 








APGSP 


READ AN 


AP S-PAD REGISTER 


#.# 


#.# 








APCHK 


CHECK AP PROGRAM ERROR CONDITION 


#.# 


#.# 








APSTAT 


GET AP 


HARDWARE STATUS 


#.# 


#.# 










BASIC VECTOR ARITHMETIC 










VCLR 


VECTOR 


CLEAR 


0.2 


0.3 


16 


4 


VMOV 


VECTOR 


MOVE 


0.5 


0.8 


16 


6 


VSWAP 


VECTOR 


SWAP 


1.2 


1.5 


21 


12 


WILL 


VECTOR 


FILL 


0.3 


0.3 


5 


5 


VRAMP 


VECTOR 


RAMP 


0.3 


0.3 


12 


12 


VNEG 


VECTOR 


NEGATE 


0.5 


0.8 


18 


7 


VADD 


VECTOR 


ADD 


0.8 


1.3 


20 


8 


VSUB 


VECTOR 


SUBTRACT 


0.8 


1.3 


20 


8 


VMUL 


VECTOR 


MULTIPLY 


0.8 


1.3 


20 


11 


VDIV 


VECTOR 


DIVIDE 


1.7 


1.7 


75 


75 


VSADD 


VECTOR 


SCALAR ADD 


0.5 


0.8 


19 


8 


VSMUL 


VECTOR 


SCALAR MULTIPLY 


0.5 


0.8 


20 


9 


VTSADD 


VECTOR 


TABLE SCALAR ADD 


0.5 


0.8 


8 


8 


VTSMUL 


VECTOR 


TABLE SCALAR MULTIPLY 


0.5 


0.8 


8 


8 


VSQ 


VECTOR 


SQUARE 


0.5 


0.8 


9 


9 


VSSQ 


VECTOR 


SIGNED SQUARE 


0.5 


0.8 


21 


9 


VABS 


VECTOR 


ABSOLUTE VALUE 


0.5 


0.8 


17 


7 


VSQRT 


VECTOR 


SQUARE ROOT 


1.8 


1.8 


79 


79 


VLOG 


VECTOR 


LOGARITHM (BASE 10) 


2.7 


2.7 


54 


58 


VLN 


VECTOR 


NATURAL LOGARITHM 


2.7 


2.7 


42 


42 
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Table 1-3 Summary of AP FORTRAN Callable Routines (cont-) 



Name 



Operation 



VALOG 


VECTOR 


VEXP 


VECTOR 


VSIN 


VECTOR 


VCOS 


VECTOR 


VATAN 


VECTOR 


VATN2 


VECTOR 


VRAND 


VECTOR 


VMSA 


VECTOR 


VSMA 


VECTOR 


VSMSB 


VECTOR 


VMA 


VECTOR 


VMSB 


VECTOR 


VAM 


VECTOR 


VSBM 


VECTOR 


VSMSA 


VECTOR 


VMMA 


VECTOR 


VMMSB 


VECTOR 


VAAM 


VECTOR 


VSBSBM 


VECTOR 


VAND 


VECTOR 


VEQV 


VECTOR 


VOR 


VECTOR 


VFRAC 


VECTOR 


VINT 


VECTOR 


VINDEX 


VECTOR 



ANTILOGARITHM (BASE 10) 

EXPONENTIAL 

SINE 

COSINE 

ARCTANGENT 

ARCTANGENT OF Y/X 

RANDOM NUMBERS 

MULTIPLY AND SCALAR ADD 

SCALAR MULTIPLY AND ADD 

SCALAR MULTIPLY AND SUBTRACT 

MULTIPLY AND ADD 

MULTIPLY AND SUBTRACT 

ADD AND MULTIPLY 

SUBTRACT AND MULTIPLY 

SCALAR MULTIPLY AND SCALAR ADD 

MULTIPLY, MULTIPLY, AND ADD 

MULTIPLY MULTIPLY AND SUBTRACT 

ADD, ADD, AND MULTIPLY 

SUBTRACT SUBTRACT AND MULTIPLY 

LOGICAL AND 

LOGICAL EQUIVALENCE 

LOGICAL OR 

TRUNCATE TO FRACTION 

TRUNCATE TO INTEGER 

INDEX 



Typ 


ical 


Prog: 


ram 


Execution 


Size 


Time 


/Loop 


(AP 


( 


us) 


PS words) 


167 


| 333 


167 | 


333 


2.3 


2.3 


58 


58 


2.3 


2.3 


55 


55 


1.3 


1.3 


34 


34 


1.3 


1.3 


34 


34 


9.7 


9.8 


87 


87 


14.2 


14.2 


88 


88 


1.2 


1.2 


16 


16 


0.8 


1.3 


23 


14 


0.8 


1.3 


21 


14 


0.8 


1.3 


21 


14 


1.2 


1.8 


23 


15 


1.2 


1.8 


23 


15 


1.2 


1.8 


23 


14 


1.2 


1.8 


23 


14 


0.5 


0.8 


23 


15 


1.5 


2.3 


27 


19 


1.5 


2.3 


27 


19 


1.5 


2.3 


13 


20 


1.5 


2.3 


13 


20 


0.8 


1.3 


20 


8 


0.8 


1.3 


20 


8 


0.8 


1.3 


20 


8 


0.7 


0.8 


13 


13 


0.5 


0.8 


9 


9 


0.8 


1.3 


28 


26 



VECTOR-TO-SCALAR OPERATIONS 



SVE SUM OF VECTOR ELEMENTS 

SVEMG SUM OF VECTOR ELEMENT MAGNITUDES 

SVESQ SUM OF VECTOR ELEMENT SQUARES 

SVS SUM OF VECTOR SIGNED SQUARES 

DOTPR DOT PRODUCT 

MAXV MAXIMUM ELEMENT IN VECTOR 

MINV MINIMUM ELEMENT IN VECTOR 

MAXMGV MAXIMUM MAGNITUDE ELEMENT IN VECTOR 

MINMGV MINIMUM MAGNITUDE ELEMENT IN VECTOR 



0.3 


0.3 


7 


7 


0.3 


0.3 


10 


10 


0.3 


0.3 


10 


10 


0.3 


0.3 


11 


11 


0.5 


0.8 


21 


9 


0.3 


0.3 


19 


19 


0.3 


0.3 


19 


19 


0.3 


0.3 


19 


19 


0.3 


0.3 


19 


19 
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Table 1-3 Summary of AP FORTRAN Callable Routines (cont.) 



Name 



Operation 



MEANV MEAN VALUE OF VECTOR ELEMENTS 

MEAMGV MEAN OF VECTOR ELEMENT MAGNITUDES 

MEASQV MEAN OF VECTOR ELEMENT SQUARES 

RMSQV ROOT-MEAN-SQUARE OF VECTOR ELEMENTS 



Typical 


Program 


Execution 


Size 


Time /Loop 


(AP 


(us) 


PS words) 


167 | 333 


167 | 333 


0.3 0.3 


49 49 


0.3 0.3 


52 52 


0.3 0.3 


52 52 


0.3 0.3 


81 81 



VECTOR COMPARISON OPERATIONS 



VMAX VECTOR MAXIMUM 

VMIN VECTOR MINIMUM 

VMAXMG VECTOR MAXIMUM MAGNITUDE 

VMINMG VECTOR MINIMUM MAGNITUDE 

VCLIP VECTOR CLIP 

VICLIP VECTOR INVERTED CLIP 

VLIM VECTOR LIMIT 

LVGT LOGICAL VECTOR GREATER THAN 

LVGE LOGICAL VECTOR GREATER THAN OR EQUAL 

LVEQ LOGICAL VECTOR EQUAL 

LVNE LOGICAL VECTOR NOT EQUAL 

LVNOT LOGICAL VECTOR NOT 

VLMERG VECTOR LOGICAL MERGE 



0.8 


1.3 


22 


13 


0.8 


1.3 


22 


13 


0.8 


1.3 


14 


14 


0.8 


1.3 


14 


14 


0.5 


0.8 


16 


16 


0.7 


0.8 


19 


19 


0.5 


0.8 


14 


14 


0.8 


1.3 


23 


13 


0.8 


1.3 


23 


13 


0.8 


1.3 


23 


13 


0.8 


1.3 


23 


13 


0.5 


0.8 


21 


12 


0.8 


1.5 


23 


16 



COMPLEX VECTOR ARITHMETIC 



CVMOV COMPLEX VECTOR MOVE 

CVFILL COMPLEX VECTOR FILL 

CVCOMB COMPLEX VECTOR COMBINE 

CVREAL FORM COMPLEX VECTOR OF REALS 

VREAL EXTRACT REALS OF COMPLEX VECTOR 

VIMAG EXTRACT IMAGINARIES OF COMPLEX VECTOR 

CVNEG COMPLEX VECTOR NEGATE 

CVCONJ COMPLEX VECTOR CONJUGATE 

CVADD COMPLEX VECTOR ADD 

CVSUB COMPLEX VECTOR SUBTRACT 



0.8 


1.3 


9 


9 


0.5 


0.7 


8 


8 


1.1 


1.7 


10 


10 


0.8 


1.2 


9 


9 


0.5 


0.8 


17 


7 


0.5 


0.8 


18 


8 


0.8 


1.3 


11 


11 


0.7 


1.3 


10 


12 


1.0 


2.0 


13 


12 


1.0 


2.0 


13 


12 
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Table 1-3 Summary of AP FORTRAN Callable Routines (cont.) 



Name 



Operation 



CVMUL 

CVSMUL 

CVRCIP 

CRVADD 

CRVSUB 

CRVMUL 

CRVDIV 

CVMA 

CVMAGS 

SCJMA 

POLAR 

RECT 

CVEXP 

CVMEXP 

CDOTPR 



COMPLEX VECTOR MULTIPLY 
COMPLEX VECTOR SCALAR MULTIPLY 
COMPLEX VECTOR RECIPROCAL 
COMPLEX AND REAL VECTOR ADD 
COMPLEX AND REAL VECTOR SUBTRACT 
COMPLEX AND REAL VECTOR MULTIPLY 
COMPLEX AND REAL VECTOR DIVIDE 
COMPLEX VECTOR MULTIPLY AND ADD 
COMPLEX VECTOR MAGNITUDE SQUARED 
SELF-CONJUGATE* MULTIPLY AND ADD 
RECTANGULAR TO POLAR CONVERSION 
POLAR TO RECTANGULAR CONVERSION 
COMPLEX VECTOR EXPONENTIAL 
VECTOR MULTIPLY COMPLEX EXPONENTIAL 
COMPLEX DOT PRODUCT 



Typ 


ical 


Pre 


)gram 


Execution 


Size 


Time 


/Loop 


( 


:ap 


( 


us) 


PS words) 


167 


I 333 


167 


1 333 


1.0 


2.0 


25 


26 


0.8 


1.3 


12 


12 


5.2 


5.2 


50 


50 


1.3 


1.8 


14 


14 


1.3 


1.8 


14 


14 


1.3 


1.8 


14 


14 


3.3 


3.3 


92 


92 


1.3 


2.7 


29 


30 


0.7 


1.2 


13 


18 


0.8 


1.5 


14 


15 


19.5 


19.5 


120 


120 


2.3 


2.3 


49 


49 


2.0 


2.0 


43 


43 


2.3 


2.3 


48 


48 


0.7 


1.3 


15 


16 



DATA FORMATTING OPERATIONS 



VFLT 


VECTOR 


VFIX 


VECTOR 


VSMAFX 


VECTOR 


VSCALE 


VECTOR 


VSCSCL 


VECTOR 


VSHFX 


VECTOR 


VUP8 


VECTOR 


VUPS8 


VECTOR 


VPK8 


VECTOR 


VUP16 


VECTOR 


VUPS16 


VECTOR 


VPK16 


VECTOR 


VFLT 3 2 


VECTOR 


VFIX32 


VECTOR 


VSEFLT 


VECTOR 



INTEGER FLOAT 

INTEGER FIX 

SCALAR MULTIPLY, ADD, AND FIX 

SCALE (POWER 2) AND FIX 

SCAN, SCALE (POWER 2) AND FIX 

SHIFT AND FIX 

8-BIT BYTE UNPACK 

8-BIT SIGNED BYTE UNPACK 

8-BIT BYTE PACK 

16-BIT BYTE UNPACK 

16-BIT SIGNED BYTE UNPACK 

16-BIT BYTE PACK 

32-BIT INTEGER FLOAT 

32-BIT INTEGER FIX 

SIGN EXTEND AND FLOAT 



0.5 


0.8 


13 


11 


0.7 


0.8 


18 


7 


0.7 


0.8 


14 


13 


0.7 


0.8 


12 


12 


1.5 


1.7 


19 


19 


0.7 


0.8 


9 


9 


0.5 


0.5 


71 


71 


0.9 


0.9 


107 


107 


0.9 


0.9 


65 


65 


0.8 


0.8 


61 


61 


1.3 


1.3 


58 


58 


0.8 


0.8 


46 


46 


1.7 


1.7 


65 


65 


1.2 


1.2 


33 


33 


0.8 


0.8 


15 


15 
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Table 1-3 Summary of AP FORTRAN Callable Routines (cont.) 



Name 



Operation 



Typical 


Program 


Execution 


Size 


Time /Loop 


(AP 


(us) 


PS words) 



167 



333 



167 I 333 



MATRIX OPERATIONS 



MTRANS MATRIX TRANSPOSE 

MMUL MATRIX MULTIPLY 

MMUL32 MATRIX MULTIPLY (DIMENSION <=>32) 

MATINV MATRIX INVERSE 

SOLVEQ LINEAR EQUATION SOLVER 

MVML3 MATRIX VECTOR MULTIPLY (3X3) 

MVML4 MATRIX VECTOR MULTIPLY (4X4) 

CTRN3 3-DIMENSION COORDINATE TRANSFORMATION 

FMMM FAST MEMORY MATRIX MULTIPLY 

FMMM32 FAST MEMORY MATRIX MULTIPLY (<=32) 



0.5 0.9 


18 


22 


0.62* 0.83 


59 


59 


0.50* 0.73 


27 


27 


1.6 * 2.1 


160 


160 


0.7 * 0.9 


216 


222 


2.0 * 2.2 


30 


30 


3.3 * 3.8 


39 


39 


2.3 * 2.5 


37 


37 


0.43* 


61 




0.41* 


33 





FFT OPERATIONS 



CFFT COMPLEX TO COMPLEX FFT (IN PLACE) 

CFFTB COMPLEX TO COMPLEX FFT (NOT IN PLACE) 

RFFT REAL TO COMPLEX FFT (IN PLACE) 

RFFTB REAL TO COMPLEX FFT (NOT IN PLACE) 

CFFTSC COMPLEX FFT SCALE 

RFFTSC REAL FFT SCALE AND FORMAT 

CFFT 2D COMPLEX TO COMPLEX 2-DIMENSIONAL FFT 

RFFT2D REAL TO COMPLEX 2-DIMENSIONAL FFT 



0.28* 


0.40 


186 


184 


0.20* 


0.28 


189 


189 


0.18* 


0.27 


253 


251 


0.14* 


0.20 


252 


252 


0.8 


1.3 


42 


42 


0.7 


0.8 


59 


59 


0.5 * 


0.5 


274 


274 


0.4 * 


0.4 


585 


585 



AUXILIARY OPERATIONS 



CONV CONVOLUTION (CORRELATION) 0.28* 0.28 106 106 

DEQ22 DIFFERENCE EQUATION, 2 POLES, 2 ZEROS 0.8 0.8 25 25 

VPOLY VECTOR POLYNOMIAL EVALUATION 1.0 * 1.2 41 41 

VSUM VECTOR SUM OF ELEMENTS INTEGRATION 0.7 0.8 13 13 
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Table 1-3 Summary of AP FORTRAN Callable Routines (cont.) 



Name 



Operation 



VTRAPZ VECTOR TRAPEZOIDAL RULE INTEGRATION 
VSIMPS VECTOR SIMPSONS 1/3 RULE INTEGRATION 
WIENER WIENER LEVINSON ALGORITHM 



Typical 


Program 


Execution 


Size 


Time /Loop 


(AP 


(us) 


PS words) 


167 | 333 


167 | 333 


0.7 0.8 


16 16 


0.7 0.8 


25 25 


0.50* 0.65 


100 100 



SIGNAL PROCESSING OPERATIONS (optional) 



HIST HISTOGRAM 

HANN HANNING WINDOW MULTIPLY 

ASPEC ACCUMULATING AUTO-SPECTRUM 

CSPEC ACCUMULATING CROSS -SPECTRUM 

VAVLIN VECTOR LINEAR AVERAGING 

VAVEXP VECTOR EXPONENTIAL AVERAGING 

VDBPWR VECTOR CONVERSION TO DB (POWER) 

TRANS TRANSFER FUNCTION 

COHER COHERENCE FUNCTION 

ACORT AUTO-CORRELATION (TIME-DOMAIN) 

ACORF AUTO-CORRELATION (FREQUENCY-DOMAIN) 

CCORT CROSS -CORRELATION (TIME-DOMAIN) 

CCORF CROSS -CORRELATION (FREQUENCY-DOMAIN) 

TCONV POSTTAPERED CONVOLUTION (CORRELATION) 



1.3 


1.4 


71 


71 


0.7 


0.8 


41 


41 


0.8 


1.5 


21 


22 


1.3 


2.7 


39 


40 


0.8 


1.3 


54 


46 


0.8 


1.3 


55 


46 


1.2 


1.3 


75 


75 


3.3 


3.3 


100 


100 


4.0 


4.5 


109 


114 


0.29* 


0.29 


121 


121 


1.80* 


2.70 


501 


489 


0.29* 


0.29 


121 


121 


2.58* 


3.93 


526 


510 


0.30* 


0.30 


112 


112 



FPS 860-7259-003 



1 - 19 



Table 1-3 Summary of AP FORTRAN Callable Routines (cont.) 



Name 



Operation 



Typical 
Execution 
Time /Loop 
(us) 



Program 
Size 
(AP 
PS words) 



167 | 333 167 | 333 



TABLE MEMORY OPERATIONS (optional) 



MTMOV VECTOR MOVE (MD TO TM) 

TMMOV VECTOR MOVE (TM TO MD) 

MTIMOV VECTOR MOVE WITH INCREMENT (MD TO TM) 

TMIMOV VECTOR MOVE WITH INCREMENT (TM TO MD) 

TTIMOV VECTOR MOVE WITH INCREMENT (TM TO TM) 

MMTADD VECTOR ADD (MD+MD TO TM) 

MMTSUB VECTOR SUBTRACT (MD-MD TO TM) 

MMTMUL VECTOR MULTIPLY (MD*MD TO TM) 

MTMADD VECTOR ADD (MD+TM TO MD) 

MTMSUB VECTOR SUBTRACT (MD-TM TO MD) 

TMMSUB VECTOR SUBTRACT (TM-MD TO MD) 

MTMMUL VECTOR MULTIPLY (MD*TM TO MD) 

MTTADD VECTOR ADD (MD+TM TO TM) 

MTTSUB VECTOR SUBTRACT (MD-TM TO TM) 

TMTSUB VECTOR SUBTRACT (TM-MD TO TM) 

MTTMUL VECTOR MULTIPLY (MD*TM TO TM) 

TTMADD VECTOR ADD (TM+TM TO MD) 

TTMSUB VECTOR SUBTRACT (TM-TM TO MD) 

TTMMUL VECTOR MULTIPLY (TM*TM TO MD) 

TTTADD VECTOR ADD (TM+TM TO TM) 

TTTSUB VECTOR SUBTRACT (TM-TM TO TM) 

TTTMUL VECTOR MULTIPLY (TM*TM TO TM) 



0.2 


0.3 


6 


7 


0.2 


0.3 


5 


5 


0.5 


0.5 


7 


7 


0.3 


0.3 


15 


15 


0.5 


0.5 


7 


7 


0.7 


0.8 


20 


13 


0.7 


0.8 


20 


13 


0.7 


0.8 


20 


13 


0.5 


0.8 


20 


9 


0.5 


0.8 


20 


9 


0.5 


0.8 


20 


9 


0-5 


0.8 


20 


9 


0.5 


0.5 


20 


20 


0.5 


0.5 


20 


20 


0.5 


0.5 


20 


20 


0.5 


0.5 


20 


20 


0.5 


0.5 


20 


20 


0.5 


0.5 


20 


20 


0.5 


0.5 


20 


20 


0.7 


0.7 


9 


9 


0.7 


0.7 


9 


9 


0.7 


0.7 


10 


10 
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Table 1-3 Summary of AP FORTRAN Callable Routines (cont.) 



Name 



Operation 



Typical 


Program 


Execution 


Size 


Time /Loop 


(AP 


(us) 


PS words) 



167 



333 



167 I 333 



APAL-CALLABLE UTILITY OPERATIONS 



DIV SCALAR DIVIDE 

SQRT SCALAR SQUARE ROOT 

LOG SCALAR LOGARITHM (BASE 10) 

LN SCALAR NATURAL LOGARITHM 

EXP SCALAR EXPONENTIAL 

SIN SCALAR SINE 

COS SCALAR COSINE 

ATAN SCALAR ARCTANGENT 

ATN2 SCALAR ARCTANGENT OF Y/X 

SP.FLT FLOAT S-PAD INTEGER 

SPUFLT S-PAD UNSIGNED FLOAT 

SPNEG S-PAD NEGATE 

SPADD S-PAD ADD 

SPSUB S-PAD SUBTRACT 

SPMUL S-PAD MULTIPLY 

SPDIV S-PAD DIVIDE 

SPRS S-PAD RIGHT SHIFT 

SPLS S-PAD LEFT SHIFT 

SPAND S-PAD AND 

SPOR S-PAD OR 

SPNOT S-PAD NOT 

SAVESP SAVE S-PAD INTO PROGRAM MEMORY 

SAVSPO SAVE S-PAD INTO PROGRAM MEMORY 

SETSP LOAD S-PADS FROM PROGRAM MEMORY 

SET2SP LOAD 2 S-PADS FROM PROGRAM MEMORY 

MDCOM MAIN DATA COMPARE AND SET S-PAD 

ZMD CLEAR ALL PAGES OF MAIN DATA MEMORY 

RDC5 READ CONTROL BIT 5 INTERRUPT 

SETC5 SET CONTROL BIT 5 INTERRUPT 

DAREAD READ DEVICE ADDRESS REGISTER 

DAWRIT WRITE DEVICE ADDRESS REGISTER 

VFCL1 VECTOR FUNCTION CALLER (1 ARGUMENT) 

VFCL2 VECTOR FUNCTION CALLER (2 ARGUMENT) 

BITREV COMPLEX VECTOR BIT REVERSE ORDERING 

REALTR REAL FFT UNRAVEL AND FINAL PASS 

FFT2 RADIX 2 FFT FIRST PASS 



3.8 


6 


3.8 


28 


28 


3.8 


6 


3.8 


28 


28 


4.7 


6 


4.7 


37 


37 


4.0 


@ 


4.0 


37 


37 


4.2 


@ 


4.2 


28 


28 


4.9 


@ 


4.9 


35 


35 


5.4 


@ 


5.4 


35 


35 


8.7 


@ 


8.7 


74 


74 


13.8 


613.8 


74 


74 


0.8 


@ 


0.8 


5 


5 


0.8 


6 


0.8 


8 


8 


0.3 


6 


0.3 


2 


2 


0.2 


6 


0.2 


1 


1 


0.2 


@ 


0.2 


1 


1 


2.3 


@ 


2.3 


14 


14 


6.2 


@ 


6.2 


43 


43 


0.3 


* 


0.3 


5 


5 


0.3 


* 


0.3 


5 


5 


0.2 


6 


0.2 


1 


1 


0.2 


6 


0.2 


1 


1 


0.2 


6 


0.2 


1 


1 


0.8 


* 


0.8 


18 


18 


2.0 


* 


2.0 


11 


11 


2.3 


* 


2.3 


33 


33 


5.7 


6 


5.7 


33 


33 


1.8 


6 


2.0 


11 


11 


0.2 




0.3 


29 


29 


1.5 


@ 


1.5 


9 


9 


0.2 


6 


0.2 


1 


1 


0.3 


6 


0.3 


2 


2 


0.3 


6 


0.3 


2 


2 


0.8 




1.0 


10 


10 


1.0 




1.0 


11 


11 


0.9 




1.4 


45 


43 


0.4 




0.7 


68 


68 


1.3 




2.7 


16 


16 
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Table 1-3 Summary of AP FORTRAN Callable Routines (cont.) 



Name 



Operation 



FFT4 

FFT2B 

FFT4B 

STSTAT 

CLSTAT 

IL0G2 

ADV2 

ADV4 

SET24B 

XCFFT 

XRFFT 

XBITRE 

XREALT 

PCFFT 

XFFT4 

CTOR 

RTOC 

SSDA 

SSDM 

SDDA 

DDDA 

DDDM 



RADIX 4 FFT PASS 

RADIX 2 FFT FIRST PASS + BIT REVERSE 

RADIX 4 FFT FIRST PASS + BIT REVERSE 

SET FFT MODE STATUS BITS 

CLEAR FFT MODE STATUS BITS 

LOGARITHM (BASE 2) 

ADVANCE POINTERS AFTER RADIX 2 FFT 

ADVANCE POINTERS AFTER RADIX 4 FFT 

SETUP FOR FFT2B AND FFT4B 

EXPANDED COMPLEX FFT 

EXPANDED REAL FFT 

EXPANDED BIT REVERSE 

EXPANDED REAL FFT FINAL PASS 

PARTIAL COMPLEX FFT 

EXPANDED RADIX 4 FFT PASS 

COMPLEX TO REAL FFT UNSCRAMBLE 

REAL TO COMPLEX FFT SCRAMBLE 

SINGLE + SINGLE TO DOUBLE ADD 

SINGLE * SINGLE TO DOUBLE MULTIPLY 

SINGLE + DOUBLE TO DOUBLE ADD 

DOUBLE + DOUBLE TO DOUBLE ADD 

DOUBLE * DOUBLE TO DOUBLE MULTIPLY 



Typical 
Execution 
Time /Loop 
(us) 



3.7 

1.3 

2.7 

5.0 

0.5 

4.0 

0.7 

0.7 

1.2 

0.32 

0.19 

3.7 

0.4 

1.05 

3.7 

0.13 

0.19 

1.5 

11.5 
4.5 
7.5 

18.5 



5.3 
2.7 
5.3 
@ 5.0 
<a 0.5 
@ 4.0 
@ 0.7 
@ 0.7 
@ 1.2 

* 0.42 

* 0.28 
3.7 
0.7 

* 1.50 
5.3 

* 0.13 

* 0.09 
@ 1.5 
@11.5 
@ 4.5 
@ 7.5 
(§18.5 



Program 
Size 
(AP 
PS words) 



167 | 333 167 



79 

25 

43 

19 

19 

19 

7 

7 

8 

187 

256 
44 
71 

117 
79 
80 

143 
10 
81 
28 
48 

117 



I 333 

79 

25 

43 

19 

19 

19 

7 

7 

8 

187 

256 
44 
71 

117 
79 
80 

143 
10 
81 
28 
48 

117 



NOTE 

#.# Timing host system dependent 

* Refer to description of routine for explanation of timing 

@ Total execution time 
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Table 1-4 Convolution (Correlation) 



ELEMENT 


COUNTS 


TYPICAL EXECUTION 
TIME/LOOP Oisl 


M 


N 


167ns 


333ns 


3 


128 


0.28 


3.28 


32 


128 


0.83 


0.83 


128 


128 


3.0 


3.3 


8 


1024 


2.3 


2.3 


32 


1024 


6.6 


6.6 


128 


1024 


' 24. Q 


24.Q 


1024 


1024 


186.2 


186.2 
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Table 1-5 Fast Fourier Transforms 





RFFT 


RFFTB 


CFFT 


CFF 


TB 


POINTS 


167ns 


333ns 


167ns 


333ns 


167ns 


333ns 


167ns 


333ns 


64 


0.13 


0.27 


0.14 


0.20 


0.28 


0.40 


0.20 


0.23 


128 


0.35 


0.50 


0.27 ■ 


0.38 


0.62 


0.95 


0.47 


0.72 


256 


0.74 


1.13 


0.58 


0.90 


1.28 


1.86 


0.97 


1.41 


512 


1.50 


2.22 


1.20 


1.76 


2.86 


4.33 


2.26 


3.48 


1024 


3.30 


5.08 


2.70. 


4. IS 


5.95 


8.73 


4.75 


5.93 


2048 


6.81 


10.12 


5.61 


8.32 


13.32 


20. 1Q 


10.33 


16.60 


4096 


14.95 


22.96 


12.56 


19.37 


27.44 


40.33 


22.56 


33.16 


3192 


30.88 


45.86 


. 26.09 


38.69 


60.33 


91.56 


50.76 


77.31 


16384 


67.19 


102.70 


57.63 


88,36 


124.70 


183.27 


105.58 


154.59 


32768 


138.42 


205.35 


119.30 


176.68 


. ... • 


" 


— 


-- 
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CHAPTER 2 



FUNCTIONAL DESCRIPTION 



2.1 INTRODUCTION 

The hardware of the AP is composed of the following three types of 
functional elements: 



• logical and control elements 

control unit 
s-pad unit 



• floating-point arithmetic elements 

floating-point adder 
floating-point multiplier 



• memory elements 



data pad unit 

main data memory unit 

table memory unit 



Each of these functional units is independent and thus can 
independently perform the programmed operations for which it was 
designed in parallel with the other functional units. 
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2.2 CONTROL UNIT 

The control unit, as illustrated by Figure 2-1, consists of: 

• program source memory (PS) 

• program source address (PSA) register 

• control buffer (CB) with decoding logic 

• subroutine return stack (SRS) 



The operation of the AP is controlled by the execution of 64-bit 
instruction words which reside in program source (PS) memory. The 
program word for the next instruction to be performed is selected by 
the address in the program source address (PSA) register. At the 
initiation of the next machine cycle, this program word is transferred 
to the control buffer (CB) where it is decoded and executed. The PSA 
is incremented by one unless a branch in the current instruction causes 
the PSA to move to another location in program source memory. Access 
to program source memory and instruction decoding is overlapped so that 
the AP can operate at a 6-MHz rate (167ns). 

Branching is accomplished in two ways. A short-range branch is 
provided by adding the 5-bit branch displacement field to the current 
PSA. This gives a branch range of from -20s to +17&. A long-range 
jump to any location in PS is accomplished by loading the desired 
target address into PSA. 

Subroutine jumps are made by a JSR instruction which saves the current 
PSA in the subroutine return stack, and sets PSA to the subroutine 
address. Return is via a return, which loads the PSA with the last 
entered return address on the SRS. 

Subroutine return address (SRA) is the subroutine return stack pointer, 
which is automatically incremented or decremented as subroutines are 
called and returns are made from the subroutine. 
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sra 



1 



SUBROUTINE 
RETURN STACK 



; i i 



PROGRAM SOURCE ADDRESS (PSA) 



PROGRAM SOURCE 
MEMORY (PS) 



CONTROL 3UFFER (CB) 



I 
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Figure 2-1 Control Unit 



2.3 S-PAD UNIT 

This unit, illustrated by Figure 2-2, performs the integer address 
indexing, loop counting, and control functions necessary to direct 
completion of a given algorithm. In form, it is similar to familiar 
minicomputers such as the PDP-11 and Nova. 



The s-pad contains sixteen 16-bit directly-addressable registers, 
contents of these registers pass through a special integer ALU 
associated with this unit. 



The 



The output of the ALU may be directed back, to the specified s-pad 
destination register and also may be directed to any of the following 
address memory registers: memory address (MA), table memory address 
(TMA), or data pad address (DPA). 
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The s-pad integer ALU functions include the following: 



function 



move 



logical complement 

clear 

increment 

decrement 

add 

subtract 

logical AND 

logical OR 

logical equivalence 



effect 

S — ► D S-source register 

S — ► D D-destination register 

— ►D 

S+l — ►D 

S-l — ►D 

D+S — ►D 

D-S — ►D 

D AND S ►D 

D OR S ►D 

D EQV S ►D 



The output of the s-pad ALU (called S-PAD FUNCTION or SPFN) may be used 
unmodified, shifted left once, shifted right once, or shifted right 
twice. 

A hardware bit-reverse function included in the s-pad accomplishes the 
bit swapping necessary to access data in scrambled order after an FFT. 

The s-pad ALU also sets three condition bits in the AP status register 
depending upon the output of the ALU/shifter: 



N: set if result <0; cleared otherwise 
Z: set if result =0; cleared otherwise 
C: set if a carry occurred; cleared otherwise 



These bits may be tested by the next AP instruction, and a branch made, 
depending upon whether the specified condition is true. 
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jATA 'AD 3US (OS) 



I 



S-PAD 
REGISTER FILE 



I 



SIT 
REVERSE 




S-PAD 

ALU 
SHIFTER 



SPFN 



DATA PAD ADDRESS 



(DPA) REGISTER 



MEMORY ADDRESS (MA) REGISTER 



TABLE MEMORY ADDRESS (TMA) REGISTER 



rl 
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Figure 2-2 S-Pad Unit 
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2.4 FLOATING-POINT ADDER UNIT 

The floating-point adder, shown in Figure 2-3, performs addition or 
subtraction operations on the contents of the adder input registers (Al 
and A2). The operation is performed in two stages, each of which takes 
one machine cycle. 

In the first stage, the exponents of the two numbers are compared and 
the fractions are aligned by shifting the fraction of the smaller 
number right. The fractions are then added or subtracted. In the 
second stage, the resultant fraction is normalized and convergently 
rounded. 

Since the two stages are independent of each other, a new pair of 
numbers can be entered into Al and A2 every AP cycle (167ns). The 
result is available for use two cycles later (333ns). 

In effect, the floating adder (FA) is a pipeline where new inputs can 
be entered into the pipeline stream every cycle. Initiation of an add 
operation loads the two numbers to be added into the Al and A2 input 
registers. The previous adder input is pushed down the pipeline to the 
adder buffer register. One cycle later, the completed result (called 
FA) from the buffer is available for storage or use by another unit. 
Thus, a new add can be started every 167ns, and the result is ready 
333ns later. 

Al may be loaded from data pad (DP), from the output of the floating 
multiplier (FM), or from table memory (TM). A2 may be loaded from data 
pad (DP), from the output of the floating adder (FA), or from main data 
memory (MD) . 

The output of the floating adder (FA) may be directed to the multiplier 
(M2), to the adder (A2), to data pad (DP), or to memory input (MI). 
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The operations performed by the floating adder are: 

• A1+A2 

• A1-A2 

• A2-A1 

• Al EQV A2 

• Al AND A2 

• Al OR A2 

• convert A2 from signed magnitude to 2's complement format 

• convert A2 from 2's complement to signed magnitude format 

• scale A2 

• absolute value of A2 

• fix A2 



Four condition bits in the AP status register are set or cleared by the 
floating adder depending upon the current result: 



FZ Set to one if result is zero, else 

cleared to zero. 



FN Set to one if result is negative, else 

cleared to zero. 



FO Set to one if exponent overflow occurred. The 

result is forced to the signed maximum value. 



FU Set to one if exponent underflow occurred. 

The result is forced to zero. 
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The overflow and underflow bits remain set until cleared by the 
program. These bits may be tested by the instruction after the 
floating adder result is completed (i.e., three cycles after the 
floating adder operation is initiated). 



ERO 



OPX 



DPV 



FM 



FA 



jPX 



2PY 



mo 



ZERO 




4— r-T— r-j, 



STAGE 2 



M2 



A2 



MI 



3PX 



SPY 



Figure 2-3 Floating-Point Adder Unit 
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2.5 FLOATING-POINT MULTIPLIER UNIT 

The floating multiplier, as illustrated in Figure 2-4, forms the 
product of the two multiplier input registers (Ml and M2). The product 
is formed in three stages, each of which takes one machine cycle. 

In the first stage, the 56-bit product of the two 28-bit fractions are 
partially completed. The second stage completes the product of the 
fractions. In the third and final stage, the exponents are added, and 
the mantissa product is normalized and convergently rounded. 

The floating multiplier, like the floating adder, is organized like a 
pipeline. Initiation of a multiply loads the two numbers to be 
multiplied into the Ml and M2 input registers. The two previous 
multiplier inputs are pushed down the pipeline to buffer 2 and buffer 
3, respectively. One cycle later, the result from buffer 3 is 
available for storage or use by another unit. 

Thus, a new product can be started every 167ns, and the result is ready 
500ns later. 

Ml can be loaded from data pad (DPX or DPY), from the output of the 
floating multiplier (FM), or from table memory (TM) . M2 is loaded from 
data pad (DPX or DPY), from the adder (Al), from the multiplier (Ml), 
or from the main data memory (MD). 

Two error bits in the AP status register are affected by the floating 
multiplier: 



FO Set if exponent overflow occurred. The result 

is forced to the signed maximum value. 



FU Set if exponent underflow occurred. The result 

is forced to zero. 
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OPX 3PY TM R1 



FA DPX OPV '-ID 



"Y 



X 



Ml 
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V!2 




IT 7 



ADD EXPONENTS START PRODUCT OF FRACTIONS 



SUFFER 2 



COMPLETE PRODUCT OF FRACTIONS 



SUFFER 3 



NORMALIZE AND ROUND 



I III I 



STAGE 1 



STAGE 2 



STAGE 3 



(FM) 



Ml 



Al MI DPX OPY 



Figure 2-4 Floating Multiplier 
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2.6 DATA PAD UNIT 

Data pad, illustrated in Figure 2-5, consists of two fast accumulator 
blocks (each with 32 floating-point locations) called data pad X (DPX) 
and data pad Y (DPY). In a single-machine cycle, the contents of one 
location from each data pad can be read out and used. In addition, 
data can also be stored into one location in each data pad in the same 
cycle. For example, in a single instruction (167ns), a multiply can be 
initiated specifying one argument from DPX and another from DPY; an 
adder result (FA) can be stored into a DPX location, and a data element 
in main data stored into a DPY location. On the very next instruction, 
similar multiple data pad accessing could be accomplished again. 

The two memories are addressed via a combination of the data pad 
address (DPA) register and four index field values contained in a given 
instruction word. DPA can be thought of as a base address register or 
stack pointer. It can be loaded from the s-pad (SPFN) or its contents 
can be incremented or decremented by one. 

For a given read or write operation (i.e., reading from data pad X) an 
index value contained in the instruction is added to the current 
contents of DPA to give the effective address for that particular 
operation. The four index fields (one each for read DPX, read DPY, 
write DPX, and write DPY) are each three bits wide and have a range 
from -4 to +3 relative to DPA. 

Data from either data pad can be used by the multiplier (Ml, M2), adder 
(Al, A2), or memory input (MI). Data can be stored into data pad from 
the adder (FA), multiplier (FM) , s-pad function output (SPFN), command 
buffer value (VALUE), or from data pad (DP). 



FPS 860-7259-003 2-11 



:mbs value opx dpy md spfn ;m 



(DATA PAD 3US * OB) 

v 



FA FM 



dRITE INDEX 
READ INDEX 




(DPX) 



DPA 



FA FM 




WRITE INDEX 
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Figure 2-5 Data Pad 
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2.7 DATA MEMORY UNIT 

The data memory unit, as illustrated in Figure 2-6, is the primary data 
store for the AP. It is available in 38-bit wide 8K modules which have 
an interleaved cycle time of 333ns (for the standard memory) and 167ns 
(for the fast memory) . 

The memory unit contains a main data memory (MD) buffer and a memory 
input (MI) buffer. Data read from memory is placed by the controller 
into MD, while data is written into memory from the MI. The memory 
address (MA) register points to the desired memory location. 

In referencing memory for read or write operations, the selected 
operation is initiated by making a change to the memory address (MA) 
register. The MA register can be loaded from the s-pad (SPFN) or its 
contents incremented or decremented by one. 

A write operation is specified by loading MI with the data to be 
written during the same instruction in which MA is changed. This data 
is then written into memory from MI during the next two AP cycles. 
Data can be loaded into MI from the floating adder (FA), floating 
multiplier (FM), data pad (DP) , main data memory (MD), table memory 
(TM), the input bus (INBS), s-pad function (SPFN), or the command 
buffer value (VALUE). A memory operation can be initiated every other 
cycle. The intervening cycle can be used for any other AP function 
except another memory initiate. 

When a memory read is initiated, the requested memory data is placed by 
the memory controller into the main data memory (MD) register three 
cycles after the reqest is made. Two instructions after the read 
request, another memory operation can be initiated. Again, the 
intervening cycle can be used for any non-memory function. Data in MD 
can be used by the floating adder (A2), floating multiplier (M2), or 
data pad (DP). 

To optimize the operation of the AP, it is necessary for the programmer 
to look ahead and initiate memory reads prior to the actual time that 
arguments from data memory are used in a calculation. 

The system provides a memory lock-out which serves to ensure that 
erroneous reads and writes of memory do not occur. If a memory 
initiate occurs while memory is busy, further program execution is 
halted until the previous memory cycle is completed. 
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Figure 2-6 Data Memory Unit 
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2.8 TABLE MEMORY UNIT 

The repeated use of standard constants (such as complex roots of unity 
and transcendental values) in signal processing routines dictates their 
ready availability to the programmer.. A separate table memory, as 
illustrated in Figure 2-7, eliminates memory accessing conflicts by 
allowing data values (constants) to be placed in separate memory banks. 

Values read from table memory are placed by the controller into the 
table memory buffer register. The table memory address (TMA) register 
serves as a pointer to the desired location. 

A table memory read is initiated by changing the contents of TMA either 
by loading a value from the s-pad (SPFN) or by incrementing or 
decrementing the contents of TMA. 

A new table value may be requested every machine cycle. This value is 
available for use two cycles later. The value can be used by the 
floating adder (Al), floating multiplier (Ml), or data pad (DP). 

In FFT mode (i.e., when FFT is being computed), the address in TMA is 
interpreted by the hardware to be an angle which points to the 
appropriate root of unity for a particular step in the algorithm. This 
allows the full table of roots of unity to be compressed into a single 
quadrant of cosines. 

Refer to Programmer's Reference Manual Part One (FPS 860-7319-000) for 
information on TMRAM. 
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Figure 2-7 Table Memory 
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2.9 INTERNAL FLOATING-POINT FORMAT 

Floating-point data internal to the AP is represented as follows; 



EXPONENT 


MANTISSA 



i 

E2 



9 10 
E9 m 



M27 



3994 



where: 

mantissa 28-bit 2's complement fraction 
exponent 10-bit binary exponent, biased by 512 

The value of a floating-point number in this format is defined as: 

mantissa * 2 (exponent -512) 



The dynamic range of this format is from 0.5 * 2~ 512 to (l-2~ 28 )*2 511 . 
or from 3.7*10~^ 55 to 6.7*10 153 . 

The 28-bit fraction, combined with the convergent rounding algorithm 
used in the floating adder and multiplier, gives a maximum relative 
error of 7.5*10 - 9 per arithmetic operation. This is a precision of 8.1 
decimal digits. As a comparison, unrounded IBM 360 format gives only 
6.0 decimal digits of arithmetic accuracy. 

The convergent rounding hardware rounds up when the magnitude of the 
remainder is greater than one-half of the least significant bit of the 
mantissa. This serves to minimize truncation errors in long series of 
arithmetic calculations. 

Format conversion between host format and AP format occurs in the 
interface and in the floating adder unit. The dynamic range of the 
internal format is large enough to accommodate IBM 360 format and other 
host formats. The extended precision of the AP internal format ensures 
that accuracy is maintained during critical stages of data analysis. 
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CHAPTER 3 



PROGRAMMING CONSIDERATIONS 



3.1 INTRODUCTION 

This chapter provides an introduction to programming the AP. The 
principal operations which control each of the six functional units are 
described below. A complete listing of the AP instruction word fields 
can be found in Appendix B. 

In the coding examples, a semi-colon (;) is used to separate 
operations within a complete instruction word. A comma (,) separates 
operands. A quote mark (") is used to denote a comment. A less than 
sign (<) is used to mean "•« — " (replaced by) where the operation 
involved is a data transfer. 



3.2 FLOATING-POINT ADDER 

The following sections describe the floating-point adder. 



FPS 860-7259-003 3 - 



3.2.1 FLOATING ADDER OPERATIONS 

Floating adder operations are initiated by the following instructions: 



instruction 


operands 


FADD 


A1.A2 


FSUB 


A1.A2 


FSUBR 


A1.A2 


FAND 


A1,A2 


FOR 


A1,A2 


FEQV 


A1,A2 


FABS 


A2 


FIX 


A2 



FSM2C 



F2CSM 



FSCALE 



A2 



A2 



A2 



operations initiated 

A1+A2 

A1-A2 

A2-A1 

Al AND A2 

Al OR A2 

Al EQV A2 

ABS(A2) 

Convert A2, floating-point 

number to fixed integer. 

Convert A2, signed magnitude 
to 2's complement. 

Convert A2, 2's complement to 
signed magnitude. 

Scale A2. 



where Al and A2 are any of the following data sources: 



Al; 



A2: 



FM 

DPX 

DPY 

TM 

ZERO 

FA 

DPX 

DPY 

MD 

ZERO 



floating multiplier result 

data pad X accumulator 

data pad Y accumulator 

last data read from table memory 

floating-point zero 

floating adder result 



last data read from data memory 
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Any data source listed under Al may be combined with any data source 
listed under A2. For example, to add a number from data pad X to 
another from data pad Y: 



FADD DPX, DPY "DPX+DPY 

To subtract a number read out of data memory from a constant in table 
memory: 

FSUB TM.MD "TM-MD 

A reverse subtract changes the order of the subtraction; i.e., 
FSUBR TM,MD "MD-TM 

subtracts a constant from table memory from a number in data memory. 

To negate a number from DPX: 

FSUB ZERO, DPX "0.0 - DPX = -DPX 



To take the absolute value of a number from data memory: 
FABS MD "ABS(MD) 



To fix (convert from floating-point to integer format) a number from 
DPY: 



FIX DPY "FIX (DPY) 



FPS 860-7259-003 3-3 



3.2.2 ADDER PIPELINE 

The floating adder is a two-stage pipeline. A FADD instruction loads 
the designated operands into the Al and A2 registers. The previous 
contents of Al and A2 are pushed down the pipeline to the buffer 
register. One AP cycle later, the new contents of the buffer have been 
normalized and rounded and are then available for use or storage 
elsewhere. 

Example 1 illustrates how the adder pipeline works, where A,B,...,G,H 
are floating-point numbers to be added. 



Example 1 



! 


CYCLE 


INSTRUCTION 


ADDER PIPELINE 


ADDER 

RESULT (FA) 


TIME 


Ul.AZ 


BUFFER 1 





1. 


FADD A,B 


A, 8 


- 


- 


167ns 


2. 


FADO CD 


CD 


A,B 


-- 


333ns 


3. 


FADO E,F 


E,F 


CD 


A+B 


500ns 


4. 


FADD S,H 


G,H 


E,F 


C+D 


667ns 


5. 


FADD 


-- 


G,H 


E+F 


833ns 


6. 


- 


— 


G,H 


G+H 



0995 
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The FADD without arguments in cycle 5 is used only to push the last 
computation into the buffer register and hence to the end of the 
pipeline. Thus, it is a dummy add because the results are unimportant 
and are never used. In Example 1, the floating-point adds are 
completed in one microsecond. During cycles 2 through 4, when the 
pipeline is full, adds are done every 167ns, the maximum rate. The 
completed results as they come out of the adder pipeline are referred 
to by the mnemonic FA. FA is dynamic in the sense that it must be used 
or stored elsewhere before being changed by the next floating adder 
instruction. The programmer, however, has complete control over the 
pipeline. Arguments advance only when pushed through the pipeline by 
floating adder instructions. 



3.2.3 AN EXAMPLE 

A complete computational sequence to do the vector sum &± m A.±*-B± f 

i -0,1, 2, 3, is shown in Example 2. A^ is stored in data pad X locations 

0-3, and B^ is stored in data pad Y location through 3. 



Example 2 



1. FADO DPX(0),DPY(O) "Do A,+B 

2. FADD 0PX(1),DPY(1) "Do A,+ 8 t 

3. FADD DPX(2),DPY(2); DPX(0)<FA "Do A 2 +B 2 , A +B is now 

done, save it in A<j 

4. FADD DPX(3),DPY(3); DPX(1)<FA "Do A 3 +B 3 , Ai+Bi is now 

done, save it in k\ 

5. FADD; DPX(2)<FA "Push Adder; save A 2 +B 2 in A 2 

6. DPX(3)<FA "Save A 3 + B 3 in A 3 
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Example 3 is a chart of this computation showing the state of the adder 
pipeline and data pad after each instruction is executed. 



Example 3 





ADDER PINPELINE 


ADDER 
RESULT 


DATA PAj X 


CYCLE 


Ia1,A2 


BUFFER i 


1 

1 


1 


2 


3 i 


1. 


Ao»B 


— 


- -- 


A o 


A, 


A , 


A 3 


2. 


Ax ,Bt 


A ,B 


- 


A, 


«l 


A 2 


A 3 


3. 


A<) ,00 


AlBj 


Aq+B, 


A +B 


A l 


A 2 


A, 


4. 


ns »°3 


A 2 ,B 2 


Ai+Bj 


Ao f B, 


A,+B, 


A, 


A3 


5. 


-- 


A 3 ,B 3 


A 2 +B 2 


A +B 


Aj+B, 


A 2 +B 2 


A 3 


6. 


-- 


A3.B3 


A 3 +B 3 


Aq+B 


Aj+Bv 


A 2 +B 2 


A 3 +3 3 



0997 



3.2.4 FLOATING ADDER TESTS 

Table 3-1 lists the conditional branches that test the floating adder 
result (FA): 



Table 3-1 Floating Adder Tests 



3R LOOP 

3FEQ LOOP 
BFNE LOOP 
3FGE LOOP 
BFGT LOOP 



"Branch unconditionally to program 

"location "LOOP" 

"Branch 1f FA-0.0 

"Branch if FA*Q.O 

"Branch if FA>0.0 

"8ranch if FA>0.0 

1059 
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The branches test FA one instruction cycle after it is ready for use. 
That is, an adder result may be tested one cycle after it comes out of 
the adder pipeline. This is shown in Example 4. 



Example 4 



1. FSUB dpx.dpv 

2. FADD 

3. DPX<FA 

4. BFEQ LOOP 



'Do a computation 

"Push the result out 

"Save the result 

"Test the result here (branch to 
" location "LOOP" if result was 
" zero) 

0998 



Compound tests may also be made. Test MD to see if it is between a 
lower limit contained in DPX (1) and an upper limit in DPX (2) (i.e., 
see if DPX(1)<MD<DPX(2)). This is shown in Example 5. 



Example 5 



1. FSUBR DPX(2),MD 

2. FSUB DPX(1),MD 

3. FAOD 

4. 8FGT BIG 

5. BFGT SMALL 

6. . . . 



"Do MD-DPX(2) 

"Do DPX(1)-MD 

"Push first test result out 

"Was too big 

"Was too small 

"OK 

0999 



The branches are made relative to the current program source address 
(PSA) with a 5-bit displacement value. This means that the conditional 
branch target address must be within -20g to +178 locations of the 
current instruction. 
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3.2.5 FLOATING-POINT LOGICAL OPERATIONS 

Instructions FAND, FOR, and FEQV perform logical operations on 
floating-point numbers. Exponent alignment occurs as for a normal 
floating-point add. The two mantissas are then combined using the 
specified logical operation. The result is then normalized and 
rounded. 



3.3 FLOATING-POINT MULTIPLIER 

The following sections describe the floating-point multiplier. 



3.3.1 MULTIPLY INSTRUCTION 

Floating-point multiplies are initiated by the following instruction: 

FMUL M1,M2 

which initiates a multiply between Ml and M2, where Ml and M2 are any 
of the following data sources: 



Ml FM floating multiplier result 

DPX data pad X accumulator 

DPY data pad Y accumulator 

TM last data read from table memory 



M2 FA floating adder result 
DPX 
DPY 
MD last data read from data memory 



Thus, any of the data sources listed under Ml can be multiplied by any 
of the data sources in M2. For example, to multiply a number read from 
data memory by a constant from table memory: 

FMUL TM,MD "TM * MD 

or, to multiply a number in data pad X by another number in data pad Y: 

FMUL DPX, DPY "DPX * DPY 
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3.3.2 MULTIPLIER PIPELINE 

The floating multiplier is a three-stage pipeline. An FMUL instruction 
loads the specified operands into the Ml and M2 registers. The two 
previous partially-completed products are pushed down the pipeline to 
buffer 2 and buffer 3, respectively. One AP cycle later, the new 
contents of buffer 3 have been normalized and rounded and are then 
available for use or storage elsewhere. 

The instruction sequence shown in Example 6 illustrates how the 
multiplier pipeline works where A,B,...,G,H are floating-point numbers 
to be multiplied together. 



Example 6 





CYCLE 


INSTRUCTION 


MULTIPLIER PIPELINE 




TIME 


| M1.M2 


BUFFER 2 BUFFER 3| 


MULTIPLIER 
RESULT (FM) 





1. 


FMUL A.B 


A,B 


~ 


- 


-- 


167ns 


2. 


FMUL CO 


CD 


A,B 


- 


-- 


333ns 


3. 


FMUL E,F 


E,F 


CO 


A.B 


-- 


500ns 


4. 


FMUL G,H 


G,H 


E,F 


CD 


A*9 


667ns 


5. 


FMUL 


— 


G,H 


E,F 


C*0 


833ns 


6. 


FMUL 


— 


- 


G,H 


E*F 


l.Ous 


7. 


-- 


— 


— 


G.H 


G*H 



1000 



The FMUL in cycles 5 and 6 are dummy multiplies used to push the last 
two computations to the end of the pipeline. In Example 6, four 
floating-point multiplies in l.Ous are completed. During cycles 3 and 
4, while the pipeline is full, products are done every 167ns, the 
maximum rate. 

The completed products as they come out of the multiplier pipeline are 
referred to by the mnemonic FM. FM is dynamic in that it must be used 
or stored before being changed by the next FMUL instruction. 



FPS 860-7259-003 



3 - 



3.3.3 AN EXAMPLE 

A computation example to square the elements in a vector is shown in 
Example 7. 



Example 7 



A. » A.*A r 1*0,1,2,3. A i is stored in Oata Pad X. 

2 

1. FMUL DPX(0),DPSX(0) "Do A 



2. FMUL DPX(l),0PX(i; "Do Ai 

2 

3. FMUL DPX(2).DPX(2) "Do A 2 

4. FMUL DPX(3),DPX(3); DPX(0)<FM "Do A 3 , save Aq 



5. FMUL: DPX(1)<FM "Save A! 



6. FMUL: DPX(2)<FM "Save A 2 

7. DPX(3)<FM "Save A 3 



1001 
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Example 8 illustrates this computation showing the state of the 
multiplier pipeline and data pad X after each instruction is executed. 



Example 8 





MULTIPLIER PIPELINE 


MULTIPLIER 
RESULT (FM) 


DATA PAD X 


CYCLE 


IM1.M2 SUFFER 2 BUFFER 3 1 


1 1 2 31 


1. 


Ao .Ao 


« 


— 


— 


A, 


Ai 


A2 


A3 


2. 


Ai.Ai 


A C ,A 


-- 


-- 


Ao 


Ai 


A 2 


A3 


3. 


A 2 ,A 2 


Ai.Ai 


A ,Ao 


— 


Ao 


Ai 


A 2 


A3 


4. 


A3.A3 


A2.A2 


Ai.Ai 


A 2 


A 2 



A^ 


A 
2 


A^ 


5. 


-- 


A3 .A3 


A 2 ,A 2 


2 
Ai 


Ao 


Ai 


A 2 


A3 


6. 


— 


-- 


A3 .A3 


A 2 


Ao 


Ai 


2 

A, 


A3 


7. 


-- 




A 3 , A 3 


2 
A3 


Ao 


7 

Ai 




A 2 


2 
A3 



1002 
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3.3.4 MULTIPLY-ADDS 

The full floating-point computational power of the AP is utilized when 
a process involving both multiplies and adds is considered. The dot 
product of two eight-element vectors A • B. = I A. B., i = -4, -3,...,1, 
2, 3, where A ± is in Data Pad X and B. is in Data Pad Y, is formed in 
Example 9. 



Example 9 



Fill the 1. FMUL DPX(-4) ,DPY(-4) 

Multiplier^ 2. FMUL DPX(-3) ,DPY(-3) 

Pipeline i, 3. FMUL DPX(.-2) ,DPY(-2) 

' 4. FMUL DPX(-l) ,DPY(-1) ; 

FADD FM.ZERO 



Fill the 

Adder 

Pipeline 



I 



5. FMUL 0PX(0),DPY(0); 
FACD FM.ZERO 

6. FMUL DPX(1),DPY(1); 
FADD FM.FA 



Both 

Pipelines ^ 7. FMUL 0PX(2) ,DPY(2) ,DPY(2) ; 

full FADD FM.FA 



8. FMUL DPX(3),DPY(3); 
V FADD FM.FA 



Empty the 
Multiplier ^ 
Pipeline 



9. FMUL; FADD FM.FA 



10. FMUL; FADD FM,FA 



11. FADD FM,FA 



12. FADD; DPX(3)<FA 

Empty 

the Adder < 13. FADD DPX(3),FA 

Pipeline 



14. FADD 

15. DPX(3)<FA 



A-l3-u is 
save it in 
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"Do A-'.B-._ 

"Do A-3B-3 

"Do A-jB-; 

"Do A-iB- ; . 

" now done, 

" adder. 

"Do A B . A-3B-3 is now 

" done, save it in the 

" adder. 

"Do AjBi. A- 2 B- 2 is now 

" coming out of the multiplier, 

" and A-uB-iifrom the adder, add 

" them together. 

"Do A 2 B 2 . A-1B-1 is now coming 

" out of the multiplier, and 

" A-3B-3 from the adder, add 

" them together. 

"Do A3B3. A Bo is now coming 

" out of the multiplier, and 

" (A-kB-i» + A- 2 B- : ) from the 

" adder, add them together. 

" AjBi is coming out of the 

" multiplier, and (A-3B-3 

" + A- 1 B- 1 ) from the adder, 

" add them together. 

"A 2 B 2 is coming out of the 

" multiplier, and (A-t,B-- 

" +A-2B--+A0B0) from the 

" adder, add them together. 

"A 3 8 3 is coming out of the 

" multiplier, and 

" (A- 3 B- 3 +A- 1 S-i+A 1 B 1 ) 

" from the adder, add 

" them together. 

"(A-n B-i, +A- 2 B-- *-Ao B +A 2 83 ) 

" is coming out of the 

" adder, save it in DPX(3). 

" ( A- 3 B- 3 +A-i B-i +Aj 8j +A3 B 3 ) 

" is coming out of the 

" adder, add it to 

" (A-uB%+A- : -B-2+AoB +A I 8 2 ) 

" which was saved in DPX(3). 

"Push result out of Adder 

"The result: (A- u B-^+ 

" A- 3 B-3+A- 2 B-2+A- 1 B- 1 + 

" A B +A 1 B 1 +A 2 8 2 +A 3 B3), 

" Saved in DPX(3). 

1003 



In accumulating the sum-of -products, the even term sum is kept in 
one-half of the adder pipeline and the odd term sum in the other half. 
During cycles 5 through 7 when both pipelines are full, floating-point 
multiply adds are computed every 167ns. This is 12 million 
floating-point computations per second. A longer sum of products 
calculation involving more terms would maintain this maximum 
computation rate, because nearly all of the time was spent filling and 
emptying pipelines. Even so, the seven adds and eight multiplies take 
15 cycles (2.5us) to complete (an overall rate of 333ns per 
floating-point multiply add). 

Example 10 summarizes the computation as a further aid in understanding 
the multiply add interaction in the sum-of-products computation of 
Example 9. 
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Example 10 



NOTE 





MULTIPLIER 


ADDER: 




CYCLE 


i Ml, M2 


FM 1 


1 Al, A2 


FA i 


DATA PAD: 
3 


1. 


^ ■» i+ j B -^ 


... 


— 


... 


... 


2. 


A-3 .8-3 





— 


... 


... 


3. 


A. 2 ,8. 2 


... 


— 


— 


— 


4. 


A-i .B.J 


A-, *3-„ 


A-,8.,,,0.0 





— 


5. 


A q,8 q 


A-3*B. 3 


AjBj ,0.0 


... 


— 


6. 


Aj,8j 


A- 2 *B. 2 


M- 2 ' A -, B -, 


A B 

"- ~k 


... 


7. 


A 2 ,B £ 


A.,**., 


A -i B -.' A -3 B - 3 


A B 

"3 "3 


... 


3. 


A3.3 3 


A *A 


A„B , ES 2 


ES 2 


— 


9. 


— 


Ai*A, 


A^l OSo 


OS 2 


... 


It). 


— 


A,*A 2 


A 2 B 2 , ES 3 


ES 3 


... 


11. 


... 


A,*A, 


A383, 0S 3 


OS 3 


— 


12. 


— 


— 


— 


ES„ 


ESu 


13. 


— 


... 


0S U , ES„ 


os u 


ES U 


14. 


— 


... 


— 


... 


E% 


15. 


— 


... 


— 


OS k +ES u 


OS^+ES., 



ES is n terms of the even term Sum: A.B. ,i = -4, -2, 0, 2 
OS is n terms of the odd term Sum: A.B J( i = -3, -1 1 3 



1004 
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3.4 DATA PAD 



The following sections describe the data pad. 



3.4.1 DATA PAD ADDRESSING 

Data pad is a block of 64 high-speed accumulators used to store 
intermediate results during a computation. In any given AP 
instruction, the programmer has 16 of the data pad accumulators to work 
with; eight in data pad X and eight in data pad Y. They are addressed 
relative to the current value of the data pad address register which 
functions as a base register for data pad. For example, if DPA has a 
value of 24g, locations 208 through 278 would be available for use. 
This is illustrated in Figure 3-1. 



DPX 



DPY 




DPA 



-3 




AVAILABLE FOR USE 
WHEN DPA = 24„ 



1005 



Figure 3-1 Data Pad Address 



A displacement value from -4 to +3 may be specified when using DPX and 
DPY (i.e., if DPA=24 g ): 



DPX (3) 
DPY (-4) 
DPX(O) 
DPY 



means DPX location 24+3=27 

means DPY location 24-4*20 

means DPX location 24+0=24 

means DPY location 24+0-24 
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Four separate displacements are provided, one each for reading and 
writing DPX and DPY. Thus, four separate locations in data pad may be 
used in a given instruction. With DPA=24s, the following instruction 
occurs in one cycle: 



FADD DPX(3),MD; 
(read DPX) 



FMUL TM,DPY(-2); DPX(-3)<FA; 
(read DPY) (write DPX) 



DPY(1)<FM 
(write DPY) 



This would add DPX location 27 to the last data read from data memory, 
multiply the last data read from table memory by the contents of DPY 
location 22, store the results of a previous add into DPX location 21, 
and store the results of a previous multiply into DPY location 25. 

All 64 locations of data pad are accessed by changing the DPA pointer: 



INCDPA 
DECDPA 
SETDPA 



"Increments DPA by 1 

"Decrements DPA by 1 

"Loads DPA with the current S-PAD 

"function (SPFN, refer to section 3.7) 



Changes in DPA take effect the next instruction after they occur (i.e., 
if DPA=24): 



FADD DPX(0),DPY(0); INCDPA 



FADD DPX (0), DPY (0); 



FADD DPX(0),DPY(0) 



INCDPA 



"DPA is still 24 so 
"DPX 2 4 is added to 
"DPY 2 4 

"Now DPA=25, so 

"DPX 2 5 is added to DPY 2 5 

"Now DPA=26, so 
"DPX 2 6 is added to 
"DPY 2 6 



Thus, by successively incrementing DPA, the data pad can be used as a 
queue; or by properly incrementing and decrementing DPA, the data pad 
can be used as a stack. Data pad address is circular. That is, with 
successive increments of DPA the next location after 37s is 0; with 
successive decrements of DPA the next location after is 37s • 
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3.4.2 WRITING INTO DATA PAD 

Data may be stored into DPX and DPY from FA, FM, or DB (the data pad 
bus) . 



and 



DPX<FA 
DPX<FM 
DPX<DB 

DPY<FA 
DPY<FM 
DPY<DB 



"Store adder result into DPX 
"Store multiplier result into DPX 
"Store data pad bus into DPX 

"Store into DPY 



The following may be selected onto the data pad bus (DB); 



DB=ZERO "Floating-point zero 

DB=INBS "Input Bus 

DB-VALUE "A 16-bit immediate value 

DB=DPX "DPX 

DB=DPY "DPY 

DB=MD "Last data read from data memory 

DB=SPFN "S-pad function (16-bit integer) 

DB-TM "Last data read from table memory 



Thus, if DPA=24s, the following instruction is possible: 



DPX(3)<FA; DPY(-2)<DB; DB-MD 



This stores the current adder result into DPX location 27 and stores 
the last data read from the main data memory into DPY location 22 via 
the data pad bus . 
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3.4.3 DATA PAD BUS 

Data to be stored into DPX and DPY can be moved through three pathways: 
FM, FA, and DB. While FM and FA are fixed in meaning (output from the 
floating multiplier and adder, respectively), the data pad bus (DB) 
pathway can be connected to any one of eight possibilities depending 
upon the programmer's choice. 

Examples : 

• MD is put into both DPX and DPY: 
DPX<DB; DPY<DB; DB=MD 



MD is put onto the data pad bus , and 

the data pad bus is stored into DPX and DPY. 



• MD is put into DPX and TM into DPY: 
DPX<DB; DB-MD; DPY<DB; DB=TM 



This is an error. Only one choice at a time 
can be made for the data pad bus . This 
double transfer would take two separate 
instructions to accomplish. 



• FA is stored into DPX and MD into DPY: 



DPX<FA; DPY<DB; DB=MD 



MD is put onto the data pad bus in order to get 
it into DPY. FA goes directly into DPX. 
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To simplify notation, data transfers invloving data pad bus can be 
written in a shorthand manner. 



shorthand longhand 

DPX<MD; DPY<MD DPX<DB; DPY<DB; DB=MD 



DPX<MD; DPY<TM DPX,DB; DB=MD; DPY,DB; DB=TM 

(still an error no matter how it is written) 



DPX<FA; DPY<MD DPX<FA; DPY<DB; DB=MD 



In the shorthand notation, choices for the data pad bus are not 
explicitly indicated. Transfers are written as if there were a direct 
connection between the source and destination while in fact it is the 
data pad bus which does the connecting. Remember, however, that the 
programmer is still making a data pad bus choice and only one choice is 
allowed per instruction. Errors like the one shown above (where two 
data pad bus choices are attempted) are detected and flagged by the 
assembler. 
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3.5 DATA MEMORY 

The following sections describe data memory. 



3.5.1 MEMORY ADDRESSING 

Main data memory cycles are initiated by changing the memory address 
register which points the memory location to be read from or written 
into: 



INCMA "Increment MA by 1 
DECMA "Decrement MA by 1 
SETMA "function (SPFN, refer to section 3.7) 



All of the above initiate a memory cycle at the address pointed at by 
the new contents of MA. If a memory input (MI) field is also included 
in the instruction, then the memory cycle is a write cycle. Otherwise, 
a read cycle is initiated. When sequential memory locations are 
accessed, a new memory cycle may be initiated by every other AP 
instruction. 
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3.5.2 DATA MEMORY READS 

Data read from memory is available for use three instruction cycles 
after the read is initiated. The instruction sequence shown in Example 
11 illustrates how memory data is accessed: A, B, and C are 
floating-point numbers in memory locations 101, 102, and 103, 
respectively. It is assumed that MA is set to 100 before starting. 



Example 11 



TIME 


AP 
CYCLE 


INSTRUCTION 


MEMORY 
ADDRESS {.MA! 


MEMORY OATA 
RESULT (MD) 





i 


INCMA 


101 


... 


167ns 


2. 


— 


101 


... 


333ns 


3. 


INCMA 


102 


— 


500ns 


4. 


— 


102 


A 


657ns 


5. 


INCMA 


103 


A 


833ns 


6. 


— 


103 


8 


l.Ous 


7. 


— 


103 


B 


1.17us 


a. 


... 


103 


C 



1006 
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Three AP cycles after a given memory location is read, data from that 
location is ready in the memory data register and available for use. 
MD may be used by the adder or the multiplier as follows: 



FADD DPX(3),MD; FMUL DPY(-2),MD "Do MD+DPX and MD * DPY 



It can also be placed on the data pad bus and stored in data pad or 
back into memory as follows: 



DPX(2)<MD 



store MD into DPX. 



3.5.3 AN EXAMPLE 

Example 12 loads a vector A ±t i=0,l,2 stored in memory locations 101, 
102, and 103 into DPX locations 10, 11, and 12. It is assumed that MA 
is set to 100 and DPA is set to 10 before starting. 



Example 12 



1. 


INCMA 




2. 


-- 




3. 


INCMA 




4. 


DPX<MD; 


INCDPA 


5. 


INCMA; 




5. 


DPX<MD; 


INCDPA 


7. 


-- 




3. 


DPX<MD 





"Fetch Ao from memory 



"Fetch Ai from memory 

"Store A into DPX location 10 
" and bump DPA pointer to 11 

"Fetch A, from memory 

"Store A,, into DPX location 11 
" and bump DPA pointer to 12 



"Store A 2 into DPX location 12 



1007 
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Example 13 illustrates the transfer of Example 12 showing the state of 
each component after each instruction. 



Example 13 





MEMORY 




DATA 


PAD 




CYCLE 


IMA 


mdI 


1 QPA 


opx 10 


DPX n 


DPX 12 1 


1 . 


101 


... 


„ 


— 


— 


— 


2. 


101 


— 


10 


— 


— 


— 


3. 


102 


— 


10 


— 


— 


— 


4. 


102 


Aq 


10 


Aq 


— 


— 


5. 


103 


*» 


11 


A, 


— 


— 


6. 


103 


A i 


11 


Ao 


Ai 


... 


7. 


103 


Al 


12 


Aq 


A! 


— 


8. 


103 


A- 


12 


Ao 


Ai 


», 
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3.5.4 DATA MEMORY WRITES 

Data memory write cycles are indicated by the following: 



MKFA 
MKFM 
MI<DB 



"write the adder result into memory 
"write the multiplier result into memory 
"write data pad bus into memory 



These instructions load data into the memory input buffer register from 
where it is written into memory. Data may be written into sequential 
memory locations by every other AP instruction. 
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3.5.5 AN EXAMPLE 

Example 14 squares the elements of a vector A^, i=0,l,2, in DPX 
locations 10, 11, and 12 and stores the results into data memory 
locations 101, 102, and 103. It is assumed that MA is set to 100 and 
DPA is set to 10 before starting. 



Example 14 



1. FMUL OPX.DPX: INCDPA 

2. FMUL 

3. FMUL OPX.DPX: INCDPA 

4. FMUL: MI<FM: INCMA 

5. FMUL DPX, DPX 

6. FMUL: MI<FM: INCMA 

7 . FMUL 

3. MI<FM; INCMA 



Square Ao, bump DPA pointer 
to 11 

Push down the multiplier 
pipeline 

Square Aj , bump DPA pointer 
to 12 

Write AJ into memory location 
101 

Square A 2 

2 

Write A! into memory location 102 
Dummy FMUL to empty pipeline 
Write A 2 into memory location 103 
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Example 15 illustrates the sequential data memory write computation. 



Example 15 





DPA 


MULTIPLIER 


MEMORY 


CYCLE 


1 Ml ,M2 FmI 


1 MA Mil 


1. 


10 


Aq.Aq 


— 


-- 


— 


2. 


11 


— 


-- 


— 


-- 


3. 


11 


Ai.Ai 


-- 


-- 


-- 


4'. 


12 


— 


2 


101 


n 

Ao 


5. 


12 


A 2 ,A, 


- 


101 


2 

Ao 


6. 


12 


— 


A? 


102 


2 

Ai 


7. 


12 


... 


-- 


102 


2 

Ai 


8. 


12 


— 


4 


103 


2 

A 2 



1Q1Q 
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3.5.6 MEMORY INTERLEAVE 

Data memory is divided into 16 banks of 4K words each using MA00-MA02 
and MA15 as a memory bank select. (These are the three highest-order 
bits and the least-significant bit of MA.) Memory references to 
different banks may be made every two AP cycles, while references to 
the same bank may be made every three AP cycles. For some possible 
memory addressing sequences refer to Table 3-2. 



Table 3-2 Memory Interleave Sequence 



MEMORY 


ADDRESS SEQUENCE (OCTAL) 


MEMORY BANK SEQUENCE 


MEMORY REFERENCE TIMING 


101, 


102, 103, 104, ... 


1, 0, 1, 0, ... 


every 2 AP cycles 


166, 


165, 164, 163, ... ■ 


0, 1, 0, 1, ... 


every 2 AP cycles 


100, 


102, 104, 106, ... 


0, 0, 0, 0, ... 


every 3 AP cycles 


233, 


10374, 234, 10376, ... 


1, 2, 0, 2, ... 


every 2 AP cycles 



101 1 



Thus , references to successive sequential memory locations can be made 
every other AP cycle, but references to successive-odd or 
successive-even locations must be three cycles apart. 
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3.5.7 MEMORY LOCKOUT 

If memory references are made too rapidly for memory to handle, the CPU 
suspends program execution and spins until the memory is no longer 
busy. Thus, suppose the following were coded: 



1. INCMA "referencing memory every cycle 

2. INCMA 

3. INCMA 



The following execution is the result: 



Ons 1. INCMA 

167ns 2. INCMA 
333ns "SPIN" 

500ns 3. INCMA 
667ns "SPIN" 



The processor waits an extra cycle after instructions 2 and 3 because 
memory is still busy from the previous memory references. This 
arrangement is fine if there is no useful computing to do during the 
spin cycles. Otherwise, it is better to space out the INCMAs and to do 
something useful during the cycle between memory references. 



FPS 860-7259-003 3-26 



3.6 TABLE MEMORY 



The following sections describe table memory. 



3.6.1 TABLE MEMORY ADDRESSING 

Constants stored in table memory are read by setting the table memory 
address (TMA) register to the address of the desired table memory 
location. This is done with the following instructions: 



INCTMA 
DECTMA 
SETTMA 



"increments TMA by 1 

"decrements TMA by 1 

"set TMA to the current s-pad 
"function (SPFN) 



Each of the above initiates a fetch from the table memory location 
pointed at by the new contents of TMA. Two AP cycles later, the 
contents of the desired locations are available for use. A new 
location can be fetched every AP cycle. The sequence in Example 16 
illustrates how table memory is accessed. KO, Kl, and K2 are constants 
stored in table memory location 235, 236, and 237. It is assumed that 
TMA is set to 234 before starting. 



Example 16 



TIME 


AP 
CYCLE 


INSTRUCTION 


TABLE MEMORY 
ADDRESS (TMA) 


TABLE MEMORY 
RESULT (TM) 





1. 


INCTMA 


235 


— 


167ns 


2. 


INCTMA 


236 


— 


333ns 


3. 


INCTMA 


237 


KO 


500ns 


4. 


— 


237 


Kl 


667ns 


5. 


— 


237 


K2 
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Two cycles after a given table memory location is fetched, the data is 
ready in the table memory data register and is available for use. TM 
can be used by the adder or the multiplier: 



FADD TM,DPX(2);FMUL TM,DPY(-3) "do TM+DPX and TM*DPY 
or put on the data pad bus and stored into data pad: 

DFX(-1)<TM "store TM into DPX 



3.6.2 AN EXAMPLE 

Example 17 forms the vector sum A ± = Bjl-HOl, i-0,1,2, where A ± is in DPX 
locations 10-12, B ± is in DPY 10-12, and Ki is a series of constants 
stored in table memory location 235-237. A ± ± s stored back into DPX. 
It is assumed that DPA is set to 10 and TMA is set to 234 before 
starting. 
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Example 17 



1. INCTMA 

2. INCTMA 

3. INCTMA; FADO TM,DPY; INCOPA 

4. FADD TM.DPY; INCDPA 

5. FADD TM,DPX (.0); DPX(.-2]<FA 

6. FADD: DPS(-1)<FA 

7. DPX(0)<FA 



"Fetch K 

"Fetch '<! 

"Do K + B , bump DPA to 11 

"Do <! + B ls bump DPA to 12 

"Do K 2 + B 2 , store A in DPX !0 

"Store Aj in DPX U 

"Store A 2 in DPX 12 



Example 18 illustrates the computations of Example 17. 



Example 18 
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TABLE MEMORY 


ADDER 


DATA PAD X 


CYCLE 


Itma tmI 


1 A1.A2 FA ! 


1 DPA 10 11 12 i 


1. 


235 


— 


— 


-- 


10 


-- 


-- 


-- 


2. 


236 


— 


— 


~ 


10 


-- 


-- 


-- 


3. 


237 


<o 


Kq.Bo 


-- 


10 


-- 


-- 


-- 


4. 


237 


Kl 


K lt Bi 


- 


11 


- 


- 


- 


5. 


237 


K 2 


K 2 ,B 2 


Krt + OQ 


12 


Ao 


-- 


- 


6. 


237 


K 2 


— 


K l+ B! 


12 


Ao 


Ai 


— 


7. 


237 


<2 


-- 


K 2 +B 2 


12 


A 


Ai 


A 2 
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3.6.3 A COMPLEX MULTIPLY 

An example using both memories, a complex multiply from the FFT (fast 
fourier transform) algorithm, is shown in Example 19. The multiply is 
between a complex signal point held in data memory and a complex 
exponential value (a root of unity, eiO) fetched from table memory. 
The computation is: 



X R =C R * W R - Ci * WI 
Xj-Cr * Wi + CiWr 



Where C is the data point and W is the complex exponential, R and I 
denote real and imaginary parts, respectively. C is in main data 
memory, and W is in table memory. 



Example 19 



Fetch the 


1. 


INCMA 


"Fetch 


C R from data memory 


r 2ur arguments 


2. 


INCTMA 


"Fetch 


■A„ from table memory 




3. 


INCMA: INCTMA 


"Fetch 


Cj fetch Wj 




4. 


FMUL TM.MO 


"Do C R 




Do the 


5. 


FMUL TM.MD: DECTMA 


"Do C R 


* Wj fetch Wj 


multiplies 


6. 


FMUL TM,MD 


"Do Cj 


•Uj 



Do the two 



7. FMUL TM.MD: 0PX(0)<FM 

8. FMUL: 0PX(1)<FM 

9. FMUL: FSUBR FM,DPX(0) 



"Do Cj * W R , save C R W R , in DPX 



"Save C R Wj in DPX 



"Do X R ♦ C R W R -C lWl 



adds 



10. FADD FM.DPXtl) 

11. DPX(0)<FA; FADD 

12. DPXCD<FA 



"DO X T - C R W T * C T W R 



X„ is ready, save in DPX 



<• is ready, save in DPX 
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The total elapsed time is 12 cycles or 2us. In practice, however, all 
but cycles four through seven with the preceding and following 
computations can overlap. The complex multiply then takes only 667ns 
when mixed in with other computations. 

Example 20 summarizes the complex multiply. 



Example 20 





MEMORIES 


MULTIPLIER 


ADDER 


DATA PAD 


CYCLE 


1 TM MD I 


1 Ml.MZ FM 1 


1 A1.A2 


faI 


i 
i 


fl 


1. 




-- 


-- 


-- 


- 


-- 


2. 


— 


— 


— 


-- 


— 


-- 


3. 


— 


.. 


-- 


-- 


-- 


-- 


4. 


W R C R 


V^R 


-- 


-- 


-- 


-- 


5. 


"I C R 


W I' C R 


-- 


- 


-- 


— 


6. 


H, Cj 


Wj.Cj 


-- 


- 


— 


-- 


7. 


\ C I 


a a .c, w R *c R 


-- 


-- 


Vr 


-- 


3. 


-- 


Mj*C R 


-- 


" 


W R C R 


W I C R 


9. 


— 




W I C I* W R C R 


— 


W R C R 


W I C R 


10. 


- 


"r l i 


WrCj.WjC-r 


X R 


w r c r 


W I C R 


11. 


— 


-- 


-- 


X I 


*R 


W I C R 


12. 


.. 


„ 


-- 


-- 


X R 


X, 
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3.7 S-PAD 

The s-pad is a 16-bit wide integer unit used primarily to compute 
memory address pointers and to test loop counters. It is similar in 
capability to a minicomputer and is programmed like the 
register-to-register instructions of the Nova and PDP-11 computers. 
There are 16 registers in the s-pad unit. 



3.7.1 SINGLE OPERAND INSTRUCTIONS 



Table 3-3 lists the single operand instructions, 
chosen from each column. 



One item can be 



Table 3-3 Single Operand Instructions 



OPERATION 


SHIFT 


NO 

LOAD 


DESTINATION 
REGISTER 


INC 


... 


... 


ast; 


DEC 


R 


i 




COM 


L 






CLR 


RR 
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The operation is performed upon the contents of the destination 
register (DST), and that result is shifted. The shifted result is 
stored in the destination register unless a no load (#) is specified. 
The shifted result is the s-pad function (SPFN), which may be stored 
into an address register (MA, TMA, or DPA) or placed onto the data pad 
bus (DB=SPFN). Some examples where SP n refers to the contents of s-pad 
register "n" are illustrated in Example 21. 
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Example 21 



ric 6 

DECR 3 

COM 3; 0PX<SPFN 

CLR* 2; SETDPA 



"(SP 6 +1)-SP 5 

"(SP 3 -l)/2-SP 3 

"5P" 3 -SP 3 *OPX 

"0-<-0PA; because of ? (no load) 
SP 2 remains unchanged 
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3.7.2 DOUBLE OPERAND INSTRUCTIONS 

Table 3-4 lists the double operand instructions, 
chosen from each column. 



One item can be 



Table 3-4 Double Operand Instructions 



OPERATION 


SHIFT 


NO 
LOAD 


DECIMATE 


SOURCE 
REGISTER 


DESTINATION 
REGISTER 


MOV 


— 


— 


... 


src, 


dst, 


ADD 


R 


i 


& 






SUB 


L 










AND 


RR 










OR 












EQV 
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The operation is performed between the source (SRC) and destination 
(DST) registers. If bit reverse (X) is specified, the contents of 
source are bit-reversed before being used. The shift is performed on 
the result which is then stored into the destination register unless no 
load (//) is specified. The shifted result is the s-pad function 
(SPFN), which may be stored into TMA, MA, or DPA or placed onto the 
data pad bus . 



Example 22 



MOV 3,15 

ADDL 6,10; SETMA 
SUB 7,13 

AN0#5,11; SETDPA 
0R# 56,7; SETTMA 
MOVRR 2,2 



SP 3 -SP 1S 

((SP, ) + (SP,)) * 2; SP !0 ^IA 

(SP 13 -SP 7 ) SP 13 

(SP n AND SP 5 )h-DPA 

CSP 7 OR SP 6 (Bit-reversed) )~TMA 

Csp 2 )/<usp, 
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For purposes of program clarity, the assembler allows names to be given 
to the s-pad registers. If register PTR is a pointer to an array in 
data memory, and register STEP contains the increment value used to 
step through the array, then the following instruction word advances 
the array pointer by the proper increment and fetches the next array 
element from memory: 



ADD STEP, PTR; SETMA 
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3.7.3 S-PAD TEST 

The following conditional branches test the s-pad function: 



BR LOOP 

BEQ LOOP 
BNE LOOP 
BGE LOOP 
BGT LOOP 



"branch unconditionally to program 

"location "LOOP" 

"branch if SPFN=0 

"branch if SPFN¥0 

"branch if SPFN>0 

"branch if SPFN>0 



The above branches test the s-pad result from the immediately preceding 
AP instruction. Thus, an s-pad operation must be done one instruction 
cycle before it is desired to test the result. 

An example of loop counting is shown in Example 23. 



Example 23 



DEC 2 
BNE LOOP 



"decrement SP2 

"branch to "LOOP if SP2 has not 

"yet reached zero 



Example 24 tests the contents of SP3 to see if it is between a lower 
limit contained in SP2 and an upper limit in SP4 (i.e., if SP2<SP3<SP4. 
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Example 24 



SUB# 3,2 

SUB 4,3; BGT SMALL 

BGT BIG 



"Too small, SP 3 <SP2 
"Too big, SP 3 >SP4 



The branches are made relative to the current program source address 
with a 5-bit displacement value. This means that the branch target 
address must be within -20g to +17 locations of the current 
instruction. 



3.7.4 AN EXAMPLE 

Example 25 loads data pad X with an array A, with N elements starting 
at main data memory location 3721 8 . C TR is in s-pad register which is 
used as a counter. 



Example 25 



1. 




CLR? CTR: SETDPA 


"Set DPA to 


2. 




LQMA: DB«3721 


'Fetch the first element 


3. 




LDSPI CTR: DB=N 


"Initialize "CTR" to N 


4. 


LOOP: 


IHCMA; DEC CTR 


"Fetch next element, A.+l 


5. 




0PX<M0; 


"Store A 1 into DPX. , adva 






INCDPA: BNE LOOP 


''DPA and test counter 
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Example 26 shows the loop in Example 25 for the N=3 elements. 



Example 26 























MEMORY 




DATA 


°AD 




S-PA n 


INSTRUCTION 
NUMBER 


1 MA 


1 

MI 1 


1 DPA 





I 


1 
21 


"CTR" 


TEST 


1. 


-- 


-- 





— 


-- 


-- 


-- 




2. 


3721 


-- 





-- 


— 


-- 


— 




3. 


-- 


-- 





-- 


-- 


-- 


3 




4. 


3722 


-- 





-- 


-- 


-- 


3 




5. 


- 


A 





Ao 


-- 


-- 


2 


true 


4. 


3723 


- 


1 


Ao 


— 


— 


2 




5. 


- 


*! 


1 


Ao 


Ai 


— 


1 


true 


4. 


-- 


-- 


z 


A 


Ai 


— 


1 




5. 


-- 


Aj 


2 


Ao 


Ai 


A 2 





false 


















1022 



FPS 860-7259-003 



3 - 37 



A generalization on the previous example to fetch array A from every 
Kth memory location is shown in Example 27. The increment is stored in 
s-pad register STEP, and the array pointer is stored in PTR. 



Example 27 



1. LDSPI STEP: DB-K 

2. CLR# CTR; SET DPA 

3. LDMA; DB-BASE 

4. LDSPI CTR: OB-N 



"Initialize "STEP" to K 
"Set DPA to 

"Fetch the first element, A„ 
"Initialize "CTR" to N 



5. LOOP: ADD STEP, PTR: SETHA 
BEQ DONE 



"Advance memory pointer. Fetch 
" next element, Af+1 . Test 
" counter and jump out if 
" done. 



0PX<MD; INCDPA 
DEC CTR: BR LOOP 



7. DONE: 



"Store A. into DPX^ advance DPA 
" Decrement "CTR" and jump 
" back to LOOP. 
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CHAPTER 4 



INTERFACE 



4.1 INTRODUCTION 

This chapter describes the interface between the host computer and the 
AP. The interface is composed of two basic parts: a simulated front 
panel and direct memory access control. The front panel allows the 
host computer to examine or modify the internal AP registers, as well 
as provides for block transfer of data from the host computer to the 
AP, and vice versa. 



4.2 FRONT PANEL 

The AP panel is used for bootstrap operations (loading and starting 
programs) and for debugging user software (inserting hardware 
breakpoints and examining and modifying AP registers and memory) . The 
panel consists of three 16-bit registers which are under the control of 
the host via the host interface. The functioning of these registers 
closely parallels that of the switches and lights on the console of a 
stand-alone computer. The host can examine and/or set these registers 
at any time, regardless of the state of the AP. The front panel and 
host interface is shown in Figure 4-1. 
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Figure 4-1 AP Panel and Host Interface 
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4.2.1 SWITCH REGISTER 

The switch register (SWR) is used to enter data and addresses into the 
AP. The SWR can be read and written by the host computer. An 
executing AP program can also read the switches. 



4.2.2 LIGHTS REGISTER 

The lights register (LITES) simulates front-panel lights and is used to 
display the contents of internal AP registers. This register can only 
be read by the host. The executing AP program can set the lights 
register. 



4.2.3 FUNCTION REGISTER 

The function register (FN) provides front-panel control operations 
(start, stop, continue, etc.). It can be read or written by the host 
The format of the function register is shown in Figure 4-2. 



i 



STOP 



START 



3 I 



5 5 | 7 



CONT 



STEP 



9 | 13 11 12 . 13 14 



RESET 



EXAM 



DEP 



BREAK 



INC 



WORD 



i — i — r 

REGISTER SELECT 



Figure 4-2 Panel Function Register Format 
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When the AP is running, only the STOP and RESET panel functions are 
valid. The other panel functions can only be exercised after the AP 
has halted. The panel functions are described in Table 4-1. 
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Table 4-1 Function Register Bits 



BIT 


MNEMONIC 


EFFECT 





STOP/HALTED 


Stop AP orogram execution upon completion of the current instruction, 
'••(hen the host reads the FN register, this bit reflects the current 
state of the processor. This bit is set if the AP is halted. 
(See note. ) 


1 


START 


Start program execution at the address specified in SWR. 


2 


CQNT 


Continue program execution at the instruction pointed at by PSA (program 
source address). 


3 


STEP 


Execute the instruction pointed at by PSA and then halt. Advance PSA 
to point to the next instruction. 


4 


RESET 


Stop the AP immediately. Clear s-pad register 0. Set SPFN to SP. Dn . 
Clear the AP status register. Stop the host DMA (CTL bit 15 5 D 
set to 0) and clear main data memory timing. 


5 


EXAM 


Examine the register or memory selected by the register select field. 
Display the portion selected by the WORD field in the panel display 
register. 


6 


DEP 


Deposit the contents of the switch register into the register or 
memory selected by the register select field. Deposit into the 
portion selected by the WORD field. 


7 


BREAK 


Enables hardware breakpointing if PSA, MA. or TMA is specified in the 
register select field. The breakpoint causes the AP to halt one instruc- 
tion after any instruction where the contents of the selected register 
was equal to the contents of the switch register. Thus, if a breakpoint is 
specified with PSA selected the AP halts after executing the instruction 
at the program location set 1n the switch register. PSA points to the 
next micro-instruction in sequence. If a breakpoint is called for on 
MA or TMA, the AP halts after executing the instruction following the 
one that referenced the trapped memory location. PSA points to the 
second sequential instruction after the one that caused 
the breakpoint. Memory breakpoints aid in debugging those elusive 
errors that modify memory unexpectedly. 


3 & 9 


INC 


Increment MA, TMA, or DPA following completion of the other specified 
panel functions. This allows sequential memory locations to be examined 
or deposited into. (Refer to Table 4-2.) 


10 & 11 


WORD 


Specifies which portion of a register is being examined or deposited 
into. (Refer to Table 4.3.) 


12 - 15 


REG. SELECT 


Specifies which AP internal register or memory location to examine or 
deposit into. (Refer to Table 4-4.) 



NOTE 

If the current instruction performs a SPIN while waiting for I/O or 
memory, the STOP does not take effect until the spin condition is 
satisfied and the instruction completed. 
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Table 4-2 Bits 8-9 



VALUE IN BITS 8 & 9 


ADDRESS REGISTER 
TO 3E INCREMENTED 





None 


1 


MA (Memory Address) 


2 


OPA (Data Pad Address) 


3 


TMA (Table Memory Address). 
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Table 4-3 Bits 10-11 



VALUE SET IN SITS 10 & 11 


<16-BIT REGISTER 


38-BIT REGISTER 


64-BIT REGISTER 





ALL 


N/A 


Bits 0-15 


1 


N/A 


Exponent Bits 00-09; 
right-justified in 
16-Bit field. 


Bits 16-31 


2 


N/A 


High mantissa 
Bits 00-11; 
right-justified 


Bits 32-47 


3 


N/A 


Low mantissa 


Bits 48-63 
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Table 4-4 Octal Values 



OCTAL VALUE SET 
IN 3ITS 12-15 


REGISTER OR 
MEMORY SELECTED 


DESCRIPTION 


I 


PSA 


Program Source Address 


1 


SPD 


S-Pad Destination Address 


2 


MA 


Main Data Address 


3 


TMA 


Table Memory Address 


4 


OPA 


Data Pad Address 


5 


SPFN 


S-Pad Function (EXAM) 




spspo 


S-Pad address by SPD (DEPOSIT] 


6 


AP STATUS 


AP Internal Status Reg. 


7 


OA 


Device Address Register 


10 


PS TMA 


Program Source Memory addressed by TMA 


11 


IOBS 


Examine I/O device output register 
addressed by DA 


12 


CB 


Control Buffer, Bits 48-63 (EXAM only) 


13 


DPX DPA-4 


Data Pad X addressed by (DPA-4) 


14 


DPY DPA-4 


Data Pad Y addressed by (DPA-4) 


15 


MD MA 


Main Data Memory addressed by MA 


16 


SPFN 


S-Pad Function (EXAM) only 


17 


™TMA 


Table Memory Addressed by TMA (EXAM only) 
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4.3 NOTES ON THE USE OF THE FRONT PANEL AND BREAKPOINT 



4.3.1 WHERE DOES THE AP STOP ON A BREAKPOINT? 



• With the breakpoint set on PSA, the AP stops 
with PSA pointing to the next instruction to be 
executed. 

Thus, breaking on a branch instruction and then 
examining PSA shows whether the branch 
condition is true or false. 



With the breakpoint set on TMA, the AP stops 

with PSA pointing to the second instruction following 

the one that set TMA to the break address. 



With the breakpoint set on MA, the AP stops on 
either the next instruction or the second instruction 
after the one that set MA to the break address, depending 
on the state of the memory lockout hardware (next 
instruction if memory lockout, second instruction if no 
memory lockout). 

Thus, the stopping point following an MA breakpoint 
has a one-instruction uncertainty. 



4.3.2 DOES THE INSTRUCTION ON WHICH THE AP STOPS EXECUTE? 

Since SPFN is current, it is set to the operation specified in the 
instruction that PSA is pointing to. Otherwise, the instruction that 
PSA is pointing to remains unexecuted. It executes correctly when the 
user steps or proceeds from the breakpoint. 
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4.3-3 WHAT ABOUT MD TIMING AND LOCKOUT ON A BREAKPOINT IN THE 
MIDDLE OF AN MD MEMORY CYCLE? 



The hardware is designed so that the AP can be 
stopped in the middle of a memory cycle. The hardware 
remembers where the memory timing is when the AP 
stops so that the processor can continue correctly 
from a breakpoint that occurs during a memory cycle. 



However, the user must not examine MD nor should 
there be any DMA transfers going to or from MD 
while the AP is stopped if the user wishes to 
proceed from the breakpoint. 

Thus, for example, it is possible to break in the 
tight- to-memory portions of the FFT and examine 
data pad or the address registers (PSA, SPA, etc.) 
and then proceed. It is not possible to proceed 
if the user or the host interface disturbs the memory 
timing by reading or writing MD or TM. 



4.3.4 SUMMARY OF THE RULE FOR PROCEEDING FROM BREAKPOINT 

If the breakpoint causes the AP to stop in the middle of the memory 
cycle (PSA pointing to first or second instruction following SETMA, 
INCMA, DECMA, or LDMA) , the user should not try to examine or modify 
MD. 



4.3.5 WHAT ABOUT STEPPING THE AP? 

The same rules for proceeding from a breakpoint apply to stepping the 
AP through a program. The user can examine and modify any register of 
memory within the constraints mentioned in section 4.3.4. 
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4.3.6 WHAT OTHER PITFALLS ARE THERE IN THE USE OF THE VIRTUAL 
FRONT PANEL? 



Note that the panel always examines SPFN, not SP . 

Thus, the user must force SPFN ■ SPgpn to see 

SPgpD' This can most easily be done via the 

panel reset function which has the side effect of also 

clearing SP(0). 



To examine TM, the user should first set TMA and 
then do a dummy panel operation (deposit TMA again, 
for example) in order to enter the output of table 
memory into the table memory buffer register. The 
user can then proceed to examine the addressed 
location using the appropriate panel functions. 



MD: setting MA from the panel initiates an MD memory 
read cycle. Depositing into MD from the panel 
initiates an MD memory write cycle. 

Thus, to write MD and then examine what was just 
written, the user must perform a deposit into MA 
operation (with the same address) to initiate a 
read cycle before examining MD. 



Using the increment field in the FN register: 

DPA and TMA always increment after the EXAM or DEP 

operation is complete (remember that TMA is used 

to address program source memory for panel operations) 

MA post-increments and initiates a new memory read 
cycle on an EXAM operation. 

MA pre- increments on a DEP operation. 
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• The recommended procedure for starting the AP is as 
follows : 



1. Set the SWR to the starting address and do a 
deposit into PSA. 

2. Set the SWR to the desired breakpoint and do a 
continue to start the AP. 



This procedure has the significant advantage of placing 
the necessary breakpoint code into the user's program 
should the AP program need debugging. 



The panel START function can be used, but the user should 
observe the following restrictions on the first 
instructions executed by the AP. The first instruction 
should not branch, jump, or modify PSA in any way other 
than to advance to the next instruction. The first 
instruction should not use the SPEC and I/O fields. In 
fact, the preferred first instruction is a NOP (all 
zeros). 
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4.4 DIRECT MEMORY ACCESS 

In addition to the panel function, the AP contains four 16-bit 
registers that are used for direct memory access (DMA) to both host and 
AP data memory, plus a 38-bit format conversion register that acts as a 
buffer between the two memories. These registers may be read and/or 
loaded from either the host computer or the AP. 



4.4.1 HOST MEMORY ADDRESS REGISTER 

The host memory register (HMA) points to consecutive locations in the 
memory of the host computer. It operates in either auto-increment or 
auto-decrement mode during DMA transfers to and from host memory. HMA 
is device address 1 for AP internal I/O transfers. 



4.4.2 WORD COUNT REGISTER 

The word count register (WC) counts the number of host memory words 
transferred in a DMA operation. It is preset to the desired number of 
words to be transferred and counts down as the transfer proceeds, 
stopping the DMA transfer when it reaches zero. Hardware logic 
prevents this register from being counted past zero. WC has AP device 
address 0. 
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4.4.3 AP DIRECT MEMORY ADDRESS REGISTER 

The AP direct memory address register (APDMA) points to consecutive 
locations in AP main data memory during DMA transfers to and from MD. 
This register can operate in either auto-increment or auto-decrement 
mode. APDMA has AP device address 3. 



4.4.4 CONTROL REGISTER 

The control register (CTL) acts as a control over the DMA and interrupt 
functions of the host interface. This register controls the direction 
and mode of transfer (DMA or program control) and the type of data 
format and provides certain bits of status information pertaining to 
the transfer. CTL has AP device address 2. The format of the control 
register is shown In Figure 4-3. The bit descriptions are contained in 
Table 4-2. 






1 


2 


3 


* 


5 


6 


7 


3 


9 


10 


11 


12 


13 14 


15 , 


WC=3 


INTR 

AP 


IAP 
WC 


IH 
HALT 


IH 

WC 


IH 
ENB 


FERR 


DLATE 


CC 


AP 
DMA 


WRT 
HOST 


DEC 
APF1A 


DEC 
HMA 


1 
FMT 

1 


HDMA 
START 
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Figure 4-3 DMA Control Register Format 
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Table 4-5 DMA Control Register Description 



BIT 


MNEMONIC 


EFFECT 








3 


wc = a 


Indicates that the word count register is zero. Note that WC is 
decremented only during DMA transfers to/from host memory tread 
only bit). Should not be used to monitor DMA activity. 


1 


INTR AP 


Sets the INTRQ (interrupt request) flag in the AP. 


2 


IAPWC 


Sets INTRQ (.interrupt request) flag in the AP when the DMA transfer 
1s done. 


3 


I HALT 


Enables a host interrupt when the AP halts. 


4 


IHWC 


Enables a host interrupt when the DMA transfer is done. 


5 


IHENB 


Interrupt Host Enable. Interrupt Host if AP attempts to set this Bit. 
This bit can actually be written only by the Host. (This is not 
suDDorted on all host systems.) 


6 


FERR 


Format error. Indicates that exponent underflow or overflow occurred 
in conversion from AP format to host floating-point format. 


7 


DLATE 


Data late. Indicates that the AP did not empty the format buffer 
before the host attempted to reload it. On some hosts this bit also 
indicates an attempt to access non-existent host memory. In either 
case the DMA transfer is terminated. 


8 


CC 


Consecutive cycle. Block DMA transfers to/from host memory occur 
without interruption. On typical hosts, the host CPU is locked out 
but other higher priority DMA devices still have access to host 
memory. 


9 


APDMA 


Allows the interface to perform DMA transfers to/from AP memory. 
Depending on the direction of transfer, a main data memory cycle is 
initiated every time the host finishes reading or loading the format 
register, whether via DMA or program control. On the AP side, the 
format register is loaded from the main data bus instead of the data 
pad bus. 


10 


WRTHOST 


Write to host. This bit controls the direction of transfer. If set, 
data is read from the AP, passed through the format register, and 
written to the host. If clear, the direction of transfer is reversed. 


11 


OECAPMA 


Decrement APMA. If set, APMA is decremented during DMA transfers 
to/from AP Main Data memory. If clear, APMA is incremented. (This 
caoabilitv is not oresent on all host systems.) 


12 


DECHMA 


Decrement HMA. If set, HMA 1s decremented during DMA transfers to/ 
from host memory. If clear, HMA is incremented. 


13 4 14 


FMT 


Format Register Control. (See note.) 


15 


HDMA 
start/busy 


Host DMA start. Initiate DMA transfers to/from host memory. When read, 
the state of this bit reflects the status of the host DMA activity 
('1' if active, '0' 1f inactive). Transfers continue until WC » 0. 



NOTE 

The format register mode of operation is controlled entirely by 
bits 9, 10, 13 and 14 of the control register. Thus, even the host 
and the AP can load and read the format register via program 
control I/O transfers at any time. The programmer must be sure 
that the type of transfer he performs is consistent with these 
bits of CTL for the transfer to be meaningful. (Refer to Table 4-6.) 
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Table 4-6 Bits 13-14 



VALUE IN SITS 13 i 14 


FORMAT TYPE 





32-Si t Integer. No format conversion. Used to 
transfer integers or program half-words. 


1 


16-Bit Integer. 16-bit integers from host are 
converted to unnormalized 38-bit AP FPNs. Low 
16-bits of AP FPN are sent to host. 


2 


Conversion of "signed-magnitude mantissa with 
binary exponent" format to/from AP floating 
point format. Includes logic to handle 
"phantom bit" formats. 


3 


Conversion of IBM 32-bit format to/from AP 
format. IBM format can be specified to have 
either sign-magnitude or two's complement 
mantissa. 



NOTE 

For format types 2 and 3, the format register has the necessary logic 
to detect overflow and underflow on conversion from AP format and to 
force a signed maximum quantity on overflow or floating point zero on 
underflow. 
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4.5 FORMAT CONVERSION REGISTER 

This 38-bit double-buffered register is used for all transfers of 
floating-point numbers (FPNs) between the host and the AP. It also 
provides the most efficient path for transfer of microcode half-words 
(32 bits). It performs bi-directional format conversions under the 
direction of bits 9, 10, 13, and 14 of the GTL register. The 
programmer must be aware of the fact that the format conversion is a 
slave to these CTL bits. Nonsence results if transfers to and from the 
formatter are not consistent with these CTL bits. The host and AP can 
read the output of the formatter at any time without restriction; 
however, the input to the formatter is controlled by CTL bits 9 and 10. 



Table 4-7 CTL Register Bits 9-10 



CTL09 


CTL10 


INPUT PATH TO FORMATTERS 


0,1 





Host Data Bus 


a 


1 


AP I/O Bus 


i 


1 


AP Main Data Output 
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The formatter has a ready indicator that can be sampled by the AP. 
This indicator tells the AP when to load new data into the formatter 
(CTL10=1) and when to read data from it (CTL10-0) after the host has 
finished reading or loading the last 16-bit word of a FPN. 

Note that in 16-bit host computers, the interface expects to receive 
words in different order depending on CTL bit 12 (DECHMA). If bit 12 
is clear (i.e., the host DMA interface is going through memory in 
forward order from low to high addresses), then the interface expects 
to receive the high word of an FPN followed by the low word. If bit 12 
is set, the interface expects to receive the low word followed by the 
high word. This is done so that arrays of FPNs are always stored in 
forward order in host memory. 

If the format CTL bits (bits 13 and 14) specify a 16-bit transfer 
(FMT=1) then the integer is loaded and read from the low word of the 
formatter. That word is considered to be the last word transferred. 

There is no corresponding indicator to the host since the AP can 
transfer data to and from the formatter faster than most host 
processors. The DLATE bit in the CTL register (CTL bit 7) does 
indicate when an error of this type occurs (i.e., when the host 
transfers data faster than the AP). 
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4.6 AP INTERNAL INTERFACE TO HOST INTERFACE 

The registers in the host interface are accessible to the AP via its 
input/output (I/O) instructions (FADD-7). 



Table 4-8 AP Device Address for Host Interface Registers 



I/O DEVICE 


DEVICE ADDRESS 


HOST INTERFACE 

DMA REGISTERS: 




WORD COUNT REGISTER (WC) 





HOST MEMORY ADDRESS REGISTER (HMA) 


1 


CONTROL REGISTER (CTL) 


2 


AP MEMORY ADDRESS REGISTER (APMA) 


,-. 3 


FORMATTER (FMT) 


4 



WRITABLE TABLE MEMORY (TMRAM) 



PAGE SELECT SELECT OPTION 

MEMORY ADDRESS EXTENSION (MAE) 


30 


APMA EXTENSION (APMAE) 


31 


MASK (including MODE and I/O) 


32 



ADDITIONAL DEVICE ADDRESSES: 
First I0P16 


10-14 


Second I0P16 


20-24 


Parity Option 


33-37 


First PIOP 


100, 101, 110-117 
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An IN, OUT, or SNSA instruction at DA=4 (FORMAT) generates an IODRDY 
response if the format register is ready to accept data from the AP 
(CTL bit 10=1) or if it has formatted data ready for the AP (CTL bit 
10=0). If CTL bit 9 is 1, the AP cannot load the formatter via I/O 
instructions since the input multiplexer to the format register is set 
to select main data instead of the AP I/O bus. Note that the AP cannot 
change the state of CTL bit 5. An interrupt of the host is generated 
if it attempts to set this bit when the bit has already been set by the 
host. The AP can read the CTL at any time without interf erring with 
the host interface. If both the host and the AP try to write CTL or 
access HMA, WC, or APMA at the same time, the host selection and data 
has priority over that of the AP. 

Access to the format conversion register is controlled by CTL bits 9, 
10, 13, and 14. Refer to section 4.4 for a description of the function 
of these bits. 
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4.7 AN EXAMPLE OF LOADING PROGRAMS INTO THE AP 

Loading and running a program in the AP from a cold start is a 
five-step process which illustrates use of the front panel. 



1. Using the AP front panel from the host computer, 
finger switch in a three-instruction bootstrap 
program into program memory. 

2. Start the bootstrap running. 

3. Set the address in the AP where the loaded 
program is to go. 

4. Start a DMA transfer of program words from 
host computer memory to the AP. The bootstrap 
program running in the AP stores these words 
into program memory. 

5. When the DMA transfer is done, stop the bootstrap 
program in the AP and then restart the AP 
executing the newly-loaded program. 



These five steps are detailed in the remainder of Chapter 4. DMA 
control and front panel interrogation is done from the host computer by 
setting various interface registers. The actual host computer I/O 
instructions to accomplish this, of course, depend upon the particular 
host computer. For the purposes of this explanation, the indicated 
numbers are loaded into a designated interface register in order to 
accomplish the desired goals. 
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Step 1: 

For the purpose of this example, the bootstrap program is put into 
program source memory locations 0, 1, and 2. 

1. Set TMA to (TMA is the pointer used by the panel 
functions for examining or depositing into program 
memory) : 

— ► SWR Put into the switches. 
1003 — ► FN Put 1003 into the function register 

(causing a deposit into TMA). 



2. Put bits 0-63 of bootstrap program program word no. 1 
into program memory location using four deposits 
of SWR — ►PSxMA* 



(bits 0-15) — ►SWR Put bits 0-15 into the switches. 
1010 — ►FN Put 1010 into the function register 

(causes a deposit into bits 

0-15 of PSxma). 



(bits 16-31) — ►SWR Put bits 16-31 into bits 
1030 ►FN 16-31 of PSxma. 



(bits 32-47) — ►SWR Put bits 32-47 into 32-47 
1050- — ►FN of PSxma. 



(bits 48-63) — ►SWR Put bits 48-63 into bits 
1370 — ►FN 48-63 of PS?^ and 

increments TMA to point to 

location 1. 



3. Repeat the second and third bootstrap program 
words in no. 2 above. 



It is necessary to perform these steps only once. 
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Step 2: 

Set the address in the AP program memory where the program is to be 
loaded by the bootstrap into TMA. For this example, this address is 
200: 

200 — ► SWR Put 200 in the switches. 
1003 — ► FN Put 1003 into the function register 
(causes a deposit into TMA). 



Step 3: 

Start the bootstrap program running in the AP. 

Set the switches to and do a start. 

— ►SWR 

40000 — ►FN Start the AP at location 0. 



The bootstrap program (as demonstrated in step 4) spins while waiting 
for words to come across the DMA from the host computer. 



Step 4s 

Start the DMA transfer from host memory into the AP. For this example, 
it is assumed that the program is in host memory at location 20000. 
The program to be loaded is 200 AP program words (or 800 16-bit host 
words) long. The actual host memory location and length could be any 
particular value. 



20000 — ►HMA Set host DMA address to 20000. 

800 ► WC Set word count to 800 host words 

(assuming a 16-bit host word width). 
201 ► CTL Start the DMA. 



Note in particular the CTL bits. Bit 15 initiates the DMA and bit 8 
requests consecutive memory cycles from the host- By not setting bits 
10 or 11, the transfer is set to go to the AP, but not into main data 
memory. Instead, the data goes only as far as the formatter which the 
bootstrap reads. If bit 4 is set, the host computer is interrupted 
when the DMA is done. 
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Step 5: 

Finally, the three-word bootstrap program is ready to run in the AP. 

1. LDDA; DB-4 "set DEVICE ADDRESS to 4 



This instruction sets the device address register so that future I/O 
instructions refer to device no. 4, which is the DMA formatter (where 
the data from the host computer ends up). 



2. LOOP:SPININ; "wait for some data 
DB=INBS; "get the data 
LPSLT "put it into the left half of P.S. 



The SPININ causes the processor to hang until the current I/O device 
address (in this case, the DMA formatter) has some new data. Then, to 
read that data, the DB=INBS puts the input data onto the data pad bus. 
The LPSLT puts what is on the data pad bus into the left half (bits 
through 31) of the program memory location pointed at by the TMA 
register. 

Two points should be considered: 

• The formatter is 32 bits wide on the AP end; every time 
the interface receives 32 bits of data from the host 
computer, the SPIN stops waiting, and another 32 bits of 
data are processed. Since the program words loaded are 
64 bits wide, they are halved (left, right, left, right, 
etc.) and stored accordingly into program memory. 



TMA is used as a pointer indicating where the bootstrap 
should place the program it is loading; thus, the LPSLT 
puts the program words into the proper place. 



3. SPININ; "wait for data 

DB=INBS; "get the data 

LPSRT; "put it into the right half 

INCTMA; "increment pointer 

BR LOOP. "go back for more 



This does basically the same as no. 2 above except that this processes 
the right half (bits 32-63) of a 64-bit program word. The INCTMA 
increments the storing pointer so instruction no. 2 stores its data 
into the next word. The branch uses loop waiting for more program 
half-words. 
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Step 6: 

Back in host, waiting for the DMA transfer is accomplished by: 

• reading the CTL register 

• testing for bit 15 (the LSB) equal to 1 

• if so, going back to step 1 

Enabling a host interrupt on DMA completion is also possible. 

When DONE, the bootstrap program is stopped (which otherwise would run 
forever) with a panel RESET function, and the newly-loaded program is 
started (example starts at location 200): 



"reset the AP 
"new program address 
"set 200 into PSA 
"continue (from 200) (i.e., start 
at AP location 200) 



To set a program breakpoint, the user can set the breakpoint address 
into the SWR and use 20400 (continue + break on PSA) for the final 
panel function. 



4000 


— ►] 


FN 


200 


— ► 


SWR 


1000 


— ► 


FN 


20000 — * 


' FN 



NOTE 

The simplest way for the running AP 
program to indicate to the host computer 
that it is done with its task is to HALT. 
When this happens, bit in the panel 
function register is set (which the host 
can test for) or a host interrupt can be 
enabled (CTL bit 3). 
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APPENDIX A 



AP REGISTERS /DATA PATH NAMES 



Table A-l Registers and Data Paths 



mnemonic 


width 


SP 


16 bits 


SPD 


4 


SPFN 


16 


PNBLS 


16 


SWR 


16 


LITES 


16 


APSTATUS 


16 


PS 


64 


CB 


64 


PSA 


16 


SRS 


16 


SRA 


16 


DPX 


38 


DPY 


38 


DB 


38 


DPA 


16 


TM 


38 


TMA 


16 


MD 


38 


MI 


38 


MA 


16 


Al 


38 


A2 


38 


FA 


38 


Ml 


38 


M2 


38 


FM 


38 


IODEVICE 




DA 


16 


INBS 


38 


IODRDY 


1 


A 


1 


B 


1 



name 

scratch pad registers (16) 

s-pad destination address register 

scratch pad ALU/shifter function output 

panel bus 

panel switch register 

panel display register 

AP status register 

program source memory 
command buffer 

program source address register 
subroutine return stack 
subroutine return stack pointer 

data pad X registers (32) 
data pad Y registers (32) 
data pad bus 
data pad address register 

table memory output register 
table memory address register 

data memory output register 
data memory Input register 
memory address register 

floating adder input register no. 1 
floating adder input register no. 2 
floating adder output register 

floating multiplier input register no. 1 
floating multiplier input register no. 2 
floating multiplier output register 

I/O device 

I/O device address 

I/O input bus 

I/O data ready flag 

I/O device condition A flag 

I/O device condition B flag 
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Subscripts indicate addressing within memory element (i.e., PSpg^ means 
the location in program source memory pointed to by the program source 
address register). 

Superscripts indicate portions of word (i.e., A2 E means the exponent 
portion of the A2 register). 

Parentheses around a symbol indicates the contents of a register (i.e., 
(Al) means the contents of the Al register). 



Table A-2 AP Internal Status Register 



» 


1 


9 


3 


4 


5 


5 


7 


8 


3 


10 


11 


12 


13 14 15 | 


OVF 


UNF 


3IVZ 


FZ 


FN 


Z 


N 


C 


PERR 


PENB 


SRAO 


IFFT 


FFT 


1 1 

BIT REVERSE 

1 1 



1034 



bits mnemonic 



OVF 



UNF 



meaning 

Set when the current adder or multiplier 
(FA or FM) has overflowed. Overflow 
occurs when an exponent value is increased 
above 511. The offending result is set to 
the signed maximum of value of (l-2"*27) * 
2511, which is roughly 6.7 * 10153. This bit 
remains set until cleared by the microprogram 
or host computer. 

Set when the current adder or multiplier 
result (FA or FM) has underf lowed. Underflow 
occurs when an exponent value is 
decreased below -512. The minimum legal 
magnitude which numbers can take without 
underf lowing is roughly 3.7 * 10~155. 
The offending value is set to zero. This bit 
remains set until cleared by the microprogram 
or host computer. 
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Table A-2 Internal Status Register (cont.) 



bits 



mnemonic 



DIVZ 



FZ 



meaning 

A divide by zero has occurred. The result 
was set to the value of the dividend. This 
bit remains set until cleared by the 
microprogram or host computer. 

Set when the current adder result (FA) 
is zero. 



FN 



Set when the current adder result (FA) 
is negative. 

Set when the current s-pad function (SPFN) 
is zero. 



N 



PERR 



10 



PENB 



SRAO 



Set when the current s-pad function (SPFN) 
is negative. 

S-pad carry bit. If no s-pad shift is 
specified, carry is the carry bit from the 
s-pad ALU. If a shift is specified, carry 
is the last bit shifted off the end of the 
s-pad result by the shift. 

(Optional). Set when a main data memory parity 
error has occurred. Three parity bits are 
used, one each to check the exponent, high 
mantissa, and low mantissa portions of the memory 
word. If PENB is set, the processor 
halts on this error. (See Page Select/ 
Parity Option Manual (FPS 860-7365-000) 
for more information. ) 

(Optional). Enables halt on memory parity error. 
If set, the processor halts when a memory 
parity error is detected. 

Subroutine return stack overflow. Set if 
more than 16 levels of nested subroutine 
calls occur. 
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Table A-2 Internal Status Register (cont.) 



bits mnemonic meaning 

11 IFFT Inverse FFT flag. When set in conjunction 

with the FFT flag, bit 12, roots of 
unity table references are interpreted 
as positive angles. 

12 FFT FFT flag. When set, table memory 

addresses are interpreted as negative 
angles referencing the roots of unity 
table contained in table memory. 

13-15 bit 15-Log2N where N is the length of a 

reverse complex data array to which the s-pad address bit 
reverse operator is being applied. 
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3US INPUTS: 




3PBS - Data 


Paa 3us (38) 


DFX 
SPY 


PS VALUE 
SPfN ZERO 


INBS 
■ID 


SWR 

71 



HOST 
COMPUTER 



ll/O DEVICE 



FUNCTIONAL UNIT OUTPUTS: 



I/O DMA . I 

-S-B- 



INBS - Input 3us (38) Formattter 
PNLBS- Panel Bus (16) 




OPX 
3PY 
MD 

tm 

FA 
FM 

5PFN 
DMA 

PS 



- Data Pad X Outout 

- Data Pad Y Outout 

- Data Memory Output 

- Table Memory Output 

- F.P. Adder Output 

- F.P. Multiplier Output 

- S-Pad ALU Outout 

- Direct Memory Address 
Program Source Output 



INBS - Input Sus 



S-PAD 
REGISTERS 



FLOATING 
POINT 
MULTIPLIER 
M1*M2 



STAGE 1 



STAGE 2 



STAGE 3 



FLOATING 
POINT 
COND. 
3 ITS 



FLOATING 

POINT 

ADDER: 

A1+A2 

A1-A2 

A2-A1 

ABS(A2) 

Al EQV A2 

Al AND A2 

Al OR A2 

FIX A2 



FLOATING 
POINT 
CONDITION 
BITS 



STAGE L 



STAGE 2 



F2 

FN 
FU 
FO 



FM 



BIT 
REV 



INTEGER 

ALU/ 
SHIFTER 



SPFN 



FA 



S-PAD ALU FUNCTIONS 

S_~D 

S -D 

S+l -D 

S-l -0 

D+S -0 

D-S -D 

D AND S -D 

D OR S -0 

EQV S -D 



S-PAD SHIFTER FUNCTION 

* 2 
+ 2 

+ 4 



INTEGER CONOITION BITS 



PBSf 
N 



DATA PAD ADDRESS 



DPBS 



SPFN 



MEMORY ADDRESS 



DPBST TABLE MEMORY ADDRESS 

■swim — 



JMA 



SS— ft PROGRAM SOURCE ADDRESS 

PNLBS' 



>DPA 
. MA 
»TMA 
► PSA 



L335 



Figure A-l AP Functional Units 



FPS 860-7259-003 



A - 



Table A-3 AP Instruction Summary 



UNCONDITIONAL FIELDS Each of the following fields may be used 1n any given Instruction word. 



OCTAL 






















OCTAL 


CODE 










FIELD NAME 










COOE 




B 


SOP 


S0P1 


SH 


SPS 


SPO 


FADD 


FADD1 


Al 


A2 







NOP 


S0P1 


NOP 


NOP 


(S-PAD 


(S-FAD 


FADD1 


NOP 


NC 


NC 





1 


& 


SPEC 


WRTEXP 


L 


Source 


Dest. 


FSUBR 


FIX 


FM 


FA 


1 


2 




ADD 


WRTHMN 


RR 


Reg.) 


Reg.) 


FSUB 


FIXT 


DPX 


DPX 


2 


3 




SUB 


WRTLHN 


R 






FADD 


FSCLT 


DPY 


DPY 


3 


4 




MOV 


NOP 




(0-17) 


(0-17) 


FEQV 


FSM2C 


TM 


MD 


4 


5 




AND 


NOP 








FAND 


F2CSM 


ZERO 


ZERO 


5 


6 




OR 


NOP 








FOR 


FSCALE 


ZERO 


MDPX 


6 


7 




EQV 


NOP 








10 


FABS 


ZERO 


EDPX 


7 


10 






CLR 
















10 


11 






INC 
















11 


12 






DEC 
















12 


13 






COM 
















13 


14 






LDSPNL 
















14 


15 






LDSPE 
















15 


16 






LDSPI 
















16 


17 






LOSPT 
















17 



OCTAL 






















OCTAL 


CODE 










FIELD NAME 










CODE 




COND 


DISP 


DPX 


DPY 


DPBS 


XR 


YR 


XW 


YW 


FM 







NOP 


(Branch 


NOP 


NOP 


ZERO 


(DPX 


(DPY 


(DPX 


(DPY 


NOP 





1 


# 


Displace- 


DB 


DB 


INBS 


Read 


Read 


Write 


Write 


FMUI, 


1 


2 


BR 


ment) 


FA 


FA 


VALUE* 


Index) 


Index) 


Index) 


Index) 




2 


3 


BINTRQ 


(0-37) 


FM 


FM 


DPX 












3 


4 


BION 








DPY 


(0-7) 


(0-7) 


(0-7) 


(0-7) 




4 


5 


BIOZ 








MD 












5 


6 


BFPE 








SPFN 












6 


7 


RETURN 








TM 












7 


10 


BFEQ 




















10 


11 


BFNE 




















11 


12 


BFGE 




















12 


13 


BFGT 




















13 


14 


BEQ 




















14 


15 


BNE 




















15 


16 


BGE 




















16 


17 


BGT 




















17 



OCTAL 
CODE 


FIELD NAME 


OCTAL 
CODE 




Ml 


M2 


MI 


MA 


DPA 


TMA 





1 
2 
3 


FM 
DPX 
DPY 
TM 


FA 
DPX 
DPY 
MD 


NOP 

FA 
FM 
DB 


NOP 
INCMA 

DECMA 
SETMA 


NOP 

INCDPA 
CECDPA 
SETDPA 


NOP 

INCTMA 
DECTMA 
SETTMA 



1 
2 

3 



This Instruction uses a 16-bit Immediate VALUE as a constant or address (1n bits 48-63 
of this Instruction). The YW, FM, Ml, M2, MI, TMA and DPA fields are then disabled for 
this instruction word. 



0627 



FPS 860-7259-003 



A - 



Table A-4 SPEC Fields 



SPEC FIELDS One of the SPEC Fields may be used per Instruction word. The 3-pad Fields C D . 



SOP, 


S0P1, SR, 


SPS, and 


SPD). are 


then disabled for this 


instruct 


on. 






OCTAL 
CODE 








FIELD 


NAME 








OCTAL 
CODE 




SPEC 


STEST 


HOSTPNL 


SETPSA 


P SEVEN 


PSODD 


PS 


SETEXIT 






1 


STEST 


BFLT 


PNLLIT 


OMPA* 


RPS0A* 


RPS1A* 


RPSLA* 


NOP 





HOSTPNL 


BLT 


DBELIT 


OSRA* 


RPS2A* 


RPS3A* 


RPSFA* 


SETEXA* 


1 
2 


2 


SPMDA 


BNC 


DBHLIT 


JMP* 


RPS0* 


RPS1* 


RPSL* 


NOP 


3 


NOP 


BZC 


DBLLIT 


JSR* 


RPS2* 


RPS3* 


RPSF* 


SETEX* 


i 


4 


NOP 


BDBN 


NOP 


JMPT 


RPS0T 


RPS1T 


RPSLT 


NOP 


i 


5 


NOP 


BDBZ 


NOP 


JSRT 


RPS2T 


RPS3T 


RPSFT 


SETEXT 


5 


5 


NOP 


BIFN 


NOP 


OMPP 


NOP 


NOP 


RPSLP 


NOP 


6 


7 


NOP 


BIFZ 


NOP 


JSRP 


NOP 


NOP 


RPSFP 


SETEXP 


/ 


10 
11 


SETPSA 


NOP 


SWDB 


NOP 


WPS0A* 


WPS1A* 


LP SLA* 


NOP 


10 


P SEVEN 


NOP 


SWDBE 


NOP 


WPS2A* 


WPS3A* 


LPSRA* 


NOP 


11 
12 


12 


PSODD 


NOP 


SWOBH 


NOP 


WPS0* 


WPS1* 


LPSL* 


NOP 


13 
14 


PS 


NOP 


SWDBL 


NOP 


WPS2* 


WPS3* 


LPSR* 


NOP 


13 


SETEXIT 


BFL0 


NOP 


NOP 


WPS0T 


WPS1T 


LPSLT 


NOP 


14 


15 


NOP 


BFL1 


NOP 


NOP 


WPS2T 


WPS3T 


LPSRT 


NOP 


15 


16 


NOP 


BFL2 


NOP 


NOP 


NOP 


NOP 


LPSLP 


NOP 


16 


17 


NOP 


BFL3 


NOP 


NOP 


NOP 


NOP 


LPSRP 


NOP 


17 



This instruction uses a 16-bit integer VALUE (in bits 48-63 of the instruction word). The 
YW, FM, Ml, M3, MI, MA, TMA, and PDA Fields are then disabled for this instruction word. 



Table A-5 I/O Fields 



I/O FIELDS One of the I/O fields may be used per instruciton word. The floating adder fields 
(FADD, FADD1, Al, and A2) are then disabled for this instruction word. 



OCTAL 
CODE 



10 



LDREG 



RDREG 



LDREG 


NOP 


RPSA 


RDREG 


LDSPO 


RSPO 


SPMDAV 


LDMA 


RMA 


NOP 


LDTMA 


RTMA 


INOUT 


LDDPA 


RDPA 


SENSE 


LDSP 


RSPFN 


FLAG 


LDAPS 


RAPS 


CONTROL 


LDDA 


RDA 



FIELD NAME 



INOUT 



SENSE 



FLAG 



CONTROL 



OUT 

SPNOUT 

OUTDA 

SPOTDA 

IN 

SPININ 

OUTDA 

SPINDA 



SNSA 

SPININ 

SNSADA 

SPNADA 

SNSB 

SPINB 

SNSBDA 

SPNDBA 



SFL0 
SFL1 
SFL2 
SFL3 
CFL0 
CFL1 
CFL2 
CFL3 



HALT 

IORST 

INTEN 

INTA 

REFR 

WRTEX 

WRTMAN 

NOP 



OCTAL 
CODE 
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APPENDIX B 



INSTRUCTION SUMMARY 



FPS 860-7259-003 B - 1 
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Table B-2 S-pad Group 



2 


1 3 


4 5 


5 




9 


10 




J. J 


3 


SOP 


3H 


SPS 


SPO 








S0P1 









FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT 


3 





- 


No-op 




1 


■1 


Use SPj-pc (bit-reversed) 


SOP 





- 


See SOP1 field 




I 


- 


See Special Operations Group 




2 


ADD 


CSP Spo ) + (SP sps )-*SPFN 




3 


SUB 


(3P spo ) - (SP Sps )— SPFN ' 




4 


MOV 


(SP sps )— »SPFN 




5 


AND 


(SP spD ) AND (SP sps )— *SPFN 




6 


OR 


(SP spD ) OR (SP Sps )— ►SPFN 




7 


EQV 


(SP spo ) x5r (sp sps )~ *SPFN 


SH 





- 


No-op 


(see NOTE) 


1 


L 


SPFN*2— »SPFN (left shift) 




2 


RR 


SPFN+4— ►SPFN (double right shift) 




3 


R 


SPFN*2— »SPFN (right shift) 


SPS 


0-17 8 


0-l7g 


S-Pad Source Operand Address 


SPO 


0-17g 


0-17 a 


S-Pad Destination Address, SPFN ► 

SP SPD unless inhibited by No Load 
(COND = 1) 



NOTE 
These are logical shifts: 
Right shift Q 

Left shift jjH Q 



0-15 



E 



0-15 
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Table B-2 S-pad Group (cont.) 



FIELD 


OCTAL 

CODE 


MNEMONIC 


EFFECT (see NOTE) 


S0P1 





- 


No-op 




1 


WRTEXP 


Restricts DPX, DPY 1 MI fields to 
Write Exponent Only 




2 


WRTHMN 


Restricts DPX, DPY & MI fields to 
Write High Mantissa Only (Bits 00-11) 




3 


WRTLMN 


Restricts DPX, DPY & MI fields to 
Write Low Mantissa Only (Bits 12-27) 




4 


- 


- 




5 


- 


- 




6 


- 


- 




7 


- 


- 




10 


CLR 


— »SPFN 




11 


INC 


(SP spD ) + 1-*SPFN 




12 


DEC 


(SP spo ) - 1— SPFN 




13 


COM 






(SP spD ) — *SPFN logical complement 




14 


LDSPNL 


SP Spo — ♦SPFN, PNLBS— ►SPjpj, 




15 


LDSPE 


SP sp[| — fSPFN, DB E - 512— ►SPjpp 




16 


LDSPI 


S P spD _* SPRt , DB ML ^SP spo 




17 


LDSPT 


MT 

SP SPD-* SPFN - DB — SP SP0 



NOTE 

MH = Mantissa High » Mantissa bits 00-U 

ML = Mantissa Low =■ Mantissa bits 12-27 

MT = Mantissa bits for table lookups » Mantissa bits 02-08 

E - Exponent 
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Table B-3 Special Operations Group 



1 1 




3| 


I 6 




9 


13 




13 


1 3 


a 


1 | 


| 


SPEC 




STEST 














HOSTPNL 




SETPSA 




P SEVEN 




PSODO 




PS 




SETEXIT 



FIELD 



OCTAL 
CODE 



MNEMONIC 



EFFECT 



SPEC 



10 



11 



12 



13 



14 



15 



16 



17 



SPMDA 



See STEST Field (B-6) 



See HOSTPNL Field (B-7) 



Spin until MD available 



See SETPSA Field, inhibit TEST except 
Mo Load (3-8) 



See PSEVEN Field (B-9) 



See PSODD Field (B-10) 



See PS Field (B-ll) 



See SETEXIT Field (8-12) 
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Table B-3 Special Operations Group (cont.) 



FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT (sae NOTE) 


STEST 


3 


3FLT 


Branch if FA<0.0 




1 


BLT 


Branch if SPFN<0 




2 


BNC 


Branch if S-Pad carry Bit » 1 




3 


BZC 


Branch if S-Pad carry bit * i 






i* 


BDBN. 


Branch if OB < 0.i3 




5 


BDBZ 


Branch if DB positive and unnormal 


ized 




5 


8IFN 


Branch if Inverse FFT flag = 1 




7 


BIFZ . 


Branch 1f Inverse FFT flag »' 




13 ■ 


- 


- 






11 


- 


- 




12 


- 


- 




13 


- 


- 




14 


BF10 


Branch if Flag =1 






IS 


BFL1 


Branch if Flag 1 = 1 




16 


3FL2 


Branch if Flag 2 = 1 




17 


BFL3 


Branch if Flag 3=1 





NOTE 

If the above specified condition is true OR 
the condition specified in the COND field is 
true, a branch occurs to (PSA) + OISP-20. 
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Table B-3 Special Operations Group (cont.) 



FIELD 



OCTAL 
CODE 



MNEMONIC 



EFFECT (see NOTE 1) 



HOSTPNL 



11 



12 



13 



14 



15 



16 



17 



PNLLIT 



DBELIT 



DBHLIT 



DBLLIT 



SWDB 



SWDBE 



SWDBH 



SWDBL 



PNLBS— ELITES 



QB L — ►PNLBS — +LITES 



DB MH -» PNLBS — ► LITES 



DB ML -»PNLBS— * LITES 



(SWR) — ►PNLBS— * OB 



(SWR)— ► PNLBS — * DB" and WRTEXP 
(see NOTE 2) 



(SWR)— ►PNLBS- 
(see NOTE 2) 



OB MH and WRTHMAN 



(SWR) — ►PNLBS' 
(see NOTE 2) 



DB ML and WRTLMAN 



NOTE 

1) MH = Mantissa High » Mantissa bits 00-11 
ML = Mantissa Low = Mantissa bits 12-27 
E » Exponent 

2) Restrict DPS, DPY and MI to: 



WRTEXP 
WRTHMAN 
WRTLMAN 



Write Exponent only 

Write High Mantissa only (bits 00-11) 

Write Low Mantissa only (bits 12-27) 
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Table B-3 Special Operations Group (cont.) 



FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT (see NOTE! 


SETPSA 


^ 


JMPA 


VALUE — ►PSA 




1 


OSRA 


(SRA) + 1— *SRA, (PSA) + 1— *SRS SRA , 
VALUE— ► PSA 




2 


JMP 


VALUE + (PSA)— * PSA 




3 


JSR 


(SRA) +■ 1— ►SRA, (PSA) + 1— »-SRS SRA , 
VALUE + (PSA)— ► PSA 




4 


JMPT 


(TMA)— »PSA 




5 


JSRT 


(SRA) + I — ►SRA, (PSA) + 1 — »SRS SRA , 
CTMA)— »PSA 




6 


JMPP 


(SWR)-*PNLBS — »PSA 




7 


JSRP 


(SRA) + 1— ►SRA, (PSA) + 1— *SRS SRA , 
(SWR) — ►PNLBS— »PSA 



NOTE 
VALUE = Sits 48-63 of this instruction (CB48-CB63) 
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Table B-3 Special Operations Group (cont.) 



: IELD 



OCTAL 
CODE 



MNEMONIC 



EFFECT (see NOTE 2) 



"SEVEN 



10 



11 



12 



13 



14 



15 



16 



17 



RPS0A 



RPS2A 



RPS0 



RPS2 



RPS0T 



RPS2T 



WPS0A 



WPS2A 



WPS0 



WPS2 



WPS0T 



WPS2T 



(PsI/aW ) — ► PNLBS "* L ITES 



(PS^ LUE )— ► PNLBS — ► LITES 



< PS VALUE + PSA> — PNLBS—LITES 



< ps value + psa)-* pnlbs -* lites 



(PS TMA ) — "PNl-BS-^LITES 



(PS$j A )— ►PNLBS— *LITES 



(SWR) — ► PNLBS — »PS. 



00 
VALUE 



(SWR) — ► PNLBS — »PS; 



02 
VALUE 



00 



(SWR)-* PNLBS —PS^ue+psa 



(SWR) — ►PNLBS — *PS; 



01 
VALUE+PSA 



(SWR) — ► PNLBS— ^PS: 



Q0 
'TMA 



(SWR) — M>NLBS — *-PS: 



Q2 
TMA 



NOTE 

1) This field requires 2 cycles to execute. 

2) VALUE = Bits 48-63 of this instruction (CB48-CB63) 

Q0 « Quarter zero of Program Source Word (PS00-PS15) 
Q2 ' Quarter two of Program Source Word (PS31-PS47) 



1043 



FPS 860-7259-003 



B - 



Table B-3 Special Operations Group (cont.) 



FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT (see NOTE 2) 


PSODD 


3 


RPS1A 


(PS VALUE ) "PNLBS— * LITES 


(see NOTE 1) 


1 


RPS3A 


iPS VALUE } * PNLBS ^ LITES 




2 


RPS1 


("CuE+PSA 5 — "PNLBS— ►LITES 




3 


RPS3 


< PS VALUE + PSA>— "PNLBS — LITES 




4 


RPS1T 


(PS TMA > t>PNLBS " LlJZS 




5 


RPS3T 


(PS^ A ) ►PNLBS ►LITES 




6 


- 


- 




7 


- 


- 




10 


WPS1A 


(SWR) ►PNLBS "PS?a LUE 




11 


WPS3A 


(SWR) ►PNLBS ►PS^lue 




12 


WPS1 


(SWR ) ►PNLBS — PS^ LUE+PSA 




13 


WPS3 


(SWR)— PNLBS — PS^ LUE+psA 




14 


WPS1T 


(SWR) »PNLBS ►PS^ 




15 


WPS3T 


(SWR) ►PNLBS "PS^ A 




16 


- 


- 




" 


- 


- 



NOTE 

1) This field requires 2 cycles to execute. 

2) VALUE « Bits 48-63 of this instruction (C848-CB63) 

Ql » Quartar one of Program Source Word (PS16-PS3I) 
Q3 = Quarter three of Program Source Word (PS48-PS63) 
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Table B-3 Special Operations Group (cont.) 



FIELD 



PS 
(see NOTE 1) 



OCTAL 
CODE 



11 



12 



13 



14 



15 



16 



17 



MNEMONIC 



RPSLA 



RPSFA 



RPSL 



RPSF 



RPSLT 



RPSFT 



RPSLP 



RPSFP 



LPSLA 



LPSRA 



LPSL 



LPSR 



LPSLT 



LPSRT 



LP.SLP 



LPSRP 



EFFECT (see NOTE 2) 



LVALUE'' 



►OB 



(ps value'" 



►OB 



(ps; 



LH 
VALUE+PSA 



) »OB 



(PS: 



FP 
VALUE-rPSA 



»OB 



(PS: 



LH x 
TMA J 



DB 



t PS ™>- 



OB 



("pNLBs'' 



DB 



^ PS PNLBS ) ' 



OB 



DB 



• ps; 



LH 
VALUE 



OB 



►ps; 



RH 

VALUE 



DB 



»PS, 



LH 



VAL'JE+PSA 



DB 



►PS 



RH 
VALUE+ PSA 



DB 



►PS, 



LH 
'TMA 



DB 



■PS 



RH 

TMA 



DB 



>PS 



LH 

PNLBS 



DB 



■PV 



RH 
PNLBS 



NOTE 

1) This field requires 2 cycles to execute. 

2) VALUE = Bits 48-63 of this instruction (CB48-CB63) 

LH = Left half of Program Source Word (Bits 30-31) 

RH = Right half of Program Source Word (Bits 32-63) 

FP = Program Source bits 26-63, used for floating-point literals 
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Table B-3 Special Operations Group (cont.) 



FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT (see NOTE) 


SETEXIT 





- 


- 




1 


SETEXA 


VALUE "SRS^ 




2 


- 


- 




3 


SETEX 


VALUE + (PSA) *SRS 5RA 




4 


- 


- 




5 


SETEXT 


TMA »SRS SRA 




6 


- 


- 




7 


SETEXP 


PSA + 1 »• SRS SRA 



NOTE 
Sets the current subroutine return address as indicated above. 
SRA does not change. 
VALUE = Bits 48-63 of this instruction. 
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Table B-4 Floating Adder Group 



1 14 




16 


17 




19 


20 




22 | 


! FADD 


Al 


A2 - | 








FADD1 









FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT 


FADD 





- 


See FADD1 field 




1 


FSUBR 


Subtract: (A2) - (Al) 




2 


FSU'B 


Subtract: (Al) - (A2) 




3 


FADD' 


Add: (Al) + (A2) 




4 


FEQV 


Logical Equivalence: (Al) XOT (A2) 




5 


FAND 


Logical and: (Al) AND (A2) 




6 


F0R : 


Logical or: (Al) OR (A2) 




7 


- 


See I/O Group 


Al 





NC 


(Al)— Ml 




1 


FM 


FM ► Al 




2 


OPX(IDX) 


^QPA+IDX 1 M1 Where XR * 1DX+4 




3 


DPY(IDX) 


' DPY DPA+1DX^ M1 Where YR = 1DX+4 




4 


TM 


(TM)— ► Al 




5 


ZERO 


0.0 ► Al 




6 


- 


- 




7 


- 


- 



NOTE 

All floating adder op-codes: 

1) Align exponents 

2) Perform the specified arithmetic, 
logical, or shift operation 

3) Normalize 

4) Convergently round 
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Table B-4 Floating Adder Group (cont.) 



FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT 


A2 





NC 


(A2) »A2 




1 


FA 


FA ► A2 




2 


DPX(IDX) 


(DPX DpA+1DX ) ^A2, Where XR = 1DX+4 




3 


DPY(IDX) 


(0PY DpA+1DX ) „A2, Where YR » 1DX+4 




4 


MD 


(MD) — * A2 




5 


ZERO 


0.0 ► A2 




6 


MDPX(IDX) 


SPFN+512 — ► A2 £ , (DPX DPA+1DX > — * A2 ' 




7 


EDPX(IDX) 


^ DPX DPA+1DX 1 ► A2 E . , SPFN — * A2 M (00-01 ) , 

— »A2 M (02-27) 


FADD1 





- 


No-op 




L 


FIX 


Convert (A2) to an Integer 




2 


FIXT 


Convert (A2) to an integer (result truncated) 




3 


FSCLT 


Shift (A2) right and increment A2 until 
A2 E = (SPFN+511) (result truncated). 




4 


FSM2C 


Convert (A2), from signed Magnitude to 2's 
complement. 




5 


F2CSM 


Convert (A2) from 2's complement to signed 
magnitude. 




6 


FSCALE 


Shift (A2) right and increment A2 E until 
A2 E « SPFN+511. 




7 


FABS 


Take the absolute value of (A2). 
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Table B-5 I/O Group 



| 14 16 | 17 19 


20 22 


1 1 1 I 1 I/O 


LDREG 




RDREG 




INOUT 




SENSE 




FLAG 




CONTROL 



FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT 


I/O 





- 


See LDREG field 




1 




See RDREG field 




2 


SPMDAV 


Spin until HD available 




3 


REX IT 


SRS (SRA)— " PNLBS 




4 


- 


See INOUT field 




5 


- 


See SENSE field 




6 


- 


See FUG field 




7 


- 


See CONTROL field 


LDREG 





- 


No-op 




1 


LDSPD 


OPBS ►SPD 




2 


LDMA 


DPBS »MA 




3 


LDTMA 


DPBS »-TMA 




A 


LDDPA 


DPBS »DPA 




5 


LDSP 


SP spo — SPFN, DPBS ►SPjpp 




6 


LOAPS 


DPBS ►APSTATUS 




7 


LDDA 


DPBS — ► DA 
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Table B-5 I/O Group (cont.) 



FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT 


*DREG 



1 
2 
3 
4 
5 
6 
7 


RPSA 
RSPD 

RMA 
RTMA 
RDPA 
RSPFN 
RAPS 

RDA 


(PSA) —►PNLBS 

(SPD) ► PNLBS 

(HA) ► PNLBS 

(TMA) — ► PNLBS 
(DPA) — ♦ PNLBS 

SPFN ► PNLBS 

(APSTATUS) ► PNLBS 

(DA) ► PNLBS 


INOUT 


13 
1 

2 
3 

4 
5 

6 
7 


OUT 
SPNOUT 

OUTDA 
SPOTDA 

IN 
SPININ 

I'NDA 
SPINDA 


DPBS — ► IODEVICE DA 

SPIN if IODRDYq. * 
DPBS — *IODEVICE QA 

DPBS — MODEVICE DA , SPFN — ►DA 

SPIN if IODRDY = 0, SPFN ► OA 

DPBS — ► IODEVICE QA 

(IODEVICE OA ) — •> INBS 

SPIN if IODRDY DA =■ 
(IODEVICE DA ) ►INBS 

(IODEVICE QA ) — * INBS, SPFN — ► DA 

SPIN if IODRDY QA « 0, SPFN — ► DA 
(IODEVICE DA ) — ►INBS 
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Table B-5 I/O Group (cont.) 



FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT (see NOTE) 


SENSE 





SNSA 


A QA ►IODRDY Flag 




1 


SPINA 


A Q „ — *■ IODRDY , SPIN if IODRDY = 




2 


SNSAOA 


A QA ► IODRDY, SPFN »DA 




3 


SPNAOA 


A DA ►IODRDY, SPIN if IODRDY = 0, 

SPFN ► DA 




4 


SNSB 


B 0A HORDY Flag 




5 


SPINB 


B QA ►IODRDY, SPIN if IODRDY = 




6 


SNSBDA 


3 Dft ►IODRDY, SPFN — ►DA 




7 


SPNBDA 


B DA — ►IODRDY, SPIN if IODRDY = 0, 
SPIN — ►DA 


FLAG 





SFL0 


1 — FLAG 




1 


SFL1 


1 ►FLAGj 




2 


SFL2 


1 — ► FLAG 2 




3 


SFL3 


1 — ► FLAG 3 




4 


CFL0 


— +FLAG 




5 


CFL1 


— ►FLAGj 




6 


CFL2 


— ►FLAGj 




7 


CFL3 


— ► FLAG 3 



NOTE 



A and B are I/O device dependent conditions, either 1 or 0. 
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Table B-5 I/O Group (cont.) 



FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT 


CONTROL 





HALT 


Halt 




1 


I0RST 


I/O reset 




2 


INTEN 


If CTL05 is set see Programmer ' s 
Reference Manual Part II, naqe E9. 




3 


INTA 


Interrupt acknowledge. Device Address of 
interrupting device put onto OPBS. 




4 


REFR 


Memory refresh sync 




5 


WRTEX 


Restricts DPX, DPY 4 MI to Write 
exponent only 




6 


WRTMAN 


Restricts DPX, DPY 4 MI to Write Mantissa 
only (Bits 0-27) 
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Table B-6 Branch Group 



23 




26 


27 




31 


COND 


DISP 



-IELD 



OCTAL 
CODE 



MNEMONIC 



EFFECT (see NOTE 2) 



COND 



11 



12 



13 



14 



15 



16 



17 



3R 



BINTRQ 



BION 



8I0Z 



BFPE 



RETURN 
(see NOTE 1) 



BFEQ 



BFNE 



BFGE 



BFGT 



BEQ 



BNE 



BGE 



BGT 



No-od 



Inhibit load of SPFN — ► SP 



SPO 



Branch always 



Branch if INTRQ (Interrupt Request 
flag = 1) 



Branch if IODRDY OA flag » 1 



Branch if IODRDY Dft flag = 



Branch on floating-point arithmetic 
error (overflow, underflow, or divide 
by zero). 



(SRS SRA ) — * PSA, (SRA) - 1 
(subroutine return jump). 



'SRA 



Branch if FA = 0-0 



Branch if FA i 0.0 



Branch if FA > 0.0 



Branch if FA > 0.0 



Branch if SPFN = 



Branch if SPFN t 



Branch if SPFN > 



Branch if SPFN > 



DISP 
(see NOTE 3' 



to 37 



If branch condition is true, 
(PSA) + DISP - 20 — ► PSA 



NOTE 

1) "RETURNS" may not be made in two successive instructions. 

2) FA and 5PFN are tested as to their state for the previous 
instruction. 

3) Thus the effective Branch Range is -20 to +17 relative to 
the current instruction. 
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Table B-7 Data Pad Group 



32 33 


34 35 


36 38 


39 




41 


42 




44 


45 




47 


48 




50 1 


DPX 


3PV 


OPBS 




XR 




YR 


XW 


YW | 



FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT 


OPX 


a 


- 


No-op 


(see NOTE 1) 


1 


OPX(1DX)<OB 


DPBS * dpx DPA+1 dx, Where XW = 1DX+4 




2 


DPX(1DX)<FA 


FA ► DPX DPA+1DX, Where XW » 1DX+4 




3 


DPX(1DX)<FM 


FM * DPX DPX+1DX, Where XW = 1DX+4 


DPY 
(see NOTE 1) 





- 


No-op 


1 


DPY(1DX}<0B 


OPBS— * 0PY DPA+IDX , Where YW = 1DX+4 




2 


DPY(1DX)<FA 


FA » OPY 

ur DPA+1DX, Where YW = 1DX+4 




3 


DPY(1DX)<FM 


FM ► 0PY DPA+1DX, Where YW = 1DX+4 


DPBS 





D8-ZER0 


0.0 ► OB 




1 


08- INBS 


INBS ► 08 




2 


DB'VALUE 


VALUE — ► 0B E , VALUE — * 38 ML , 
sign extended into DB m 




3 


DB=0PXC1DX) 


(DPX 0pA+10x j ►OB , Where XR - 1DX+4 




4 


DB=0PY(1DX) 


(DPY DPA+10X) * DB ' Where YR = 1DX+4 




5 


0B=MD 


(MO) ► OB 




6 


DB=SPFN 


SPFN + 512 — ► DB E , SPFN — ► JB '' ,L , 
sign extended into OB M( , 




7 


0B=TM 


(TM) — fcOB 



NOTE 

1) All bits written unless WRTEXP, WRTHMAN or WRTLMAN set. 
See S0P1 and HOSTPNL field. 

2) DPBS forced to if HOSTPNL field = 10 to 13 
ML = Mantissa Low (Mantissa Bits 12-27) 

MH = Mantissa High (Mantissa Bits 00-11) 
E = Exponent 

VALUE is a 16-bit 2's complement number, contained in 
bits 48-63 of the instruction word. 
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Table B-7 Data Pad Group (cont.) 



FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT 


XR 


to 7 




DPX Read EFA is (DPA) + XR - 4 


YR 


to 7 




DPY Read EFA is (DPA) + YR - 4 


XW 


to 7 




DPX Write EFA is (DPA) + XW - 4 


YW 


to 7 




DPY Write EFA 1s (DPA) + YW - 4, 
YW=XW if VALUE is used in another 
field. 
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Table B-8 Floating Multiplier Group 



51 


52 




53 


1 54 




55 




FM 




Ml 




1 


M2 







FIELD 


OCTAL 
CODE 


MNEMONIC 


EFFECT 


FM 





- 


No-op 




1 


FMUL 


Multiply: (Ml)*(M2l 


Ml 





FM 


FM — ► Ml 




1 


OPX(IDX) 


t0PX DPA+lDX ) ~* M1 ' Where XR=1DXM 




2 


OPY(IDX) 


(DPY opA+1QX ) — Ml, Where YR-10X+4 




3 


TM 


(TM) >M1 


M2 





FA 


FA ► M2 




1 


DPX(IDX) 


(0PX 0PA+1DX ) *' M2 ' Where XR * 1DX+4 




2 


OPY(IDX) 


(DPY DPA+1DX 1_ * M2 ' Where YR=1DXM 




3 


MD 


(MD) — ► M2 



NOTE 

These fields are not in effect if VALUE is used in another field. 
Arguments that are unnormalized by more than one position will 
produce incorrect results. 
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Table B-9 Memory Group 



56 



MI 



57 



5S 



59 



MA 



60 



61 



DPA 



62 



63 



TMA 



FIELD 
(see NOTE 1) 


OCTAL 
CODE 


MNEMONIC 


EFFECT (see NOTE 3) 


MI 


9 


- 


No-op 




1 


MI<FA 


FA — ►MI, Write MI into Data Memory 
(see NOTE 2) 




2 


MI<FM 


FM — ►MI, Write MI into Data Memory 
(see NOTE 2) 




3 


MI<DB 


DB — ► MI, Write MI into Data Memory 
(see NOTE 2) 


MA 





- 


No-op 




1 


INCMA 


(MA)+1 — ►MA, initiate a Data Memory 
cycle 




2 


DECMA 


(MA)-l — ►MA, initiate a Data Memory 
cycle 




3 


SETMA 


SPFN — ►MA, initiate a Data Memory 
cycle 


DPA 





- 


No-op 




1 


INCDPA 


(DPA)+1 — *DPA 




2 


DECDPA 


(DPA)-l — ► DPA 




3 


SETDPA 


SPFN ► DPA 


TMA 





- 


No-op 




1 


INCTMA 


(TMA)+1 — ► TMA, initiate a read from 
Table Memory 




2 


DECTMA 


(TMA)+1 ►TMA, initiate a read from 

Table Memory 




3 


SETTMA 


SPFN ► TMA, initiate a read from 

Table Memory 



NOTE 

1) These fields are not in effect if a value is used by another field. Changes made in MA, TMA, 
or DPA do not affect the values of these registers used by other fields during the current 
instruction. 

2} All bits written unless WRTEXP, WRTHMAN or WRTLMAN is set. See S0P1 and HOSTPNL fields. 

3) DB 1s used in place of SPFN if LDREG field is used. 
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Notice to the Reader 



• Help us improve the quality and usefulness 
of this manual. 

• Your comments and answers to the following 
READERS COMMENT form would be appreciated. 



To mail: fold the form in three parts so 
that Floating Point Systems' 
mailing address is visible; seal. 

Thank you 



READERS COMMENT FORM 



Document Title 



Your comments and answers will help 
us improve the quality and usefulness 
of our publications. If your answers 
require qualification or additional 
explanation, please comment in the 
space provided below. 



How did you use this manual? 

AS AN INTRODUCTION TO THE SUBJECT 

AS AN AID FOR ADVANCED TRAINING 

TO LEARN OF OPERATING PROCEDURES 

TO INSTRUCT A CLASS 

AS A STUDENT IN A CLASS 

AS A REFERENCE MANUAL 

OTHER 



Did you find this material , . . 



YES 



NO 



• 


USEFUL? ( ) ( ) 


• 


COMPLETE? ( ) ( ) 


• 


ACCURATE? ( ) ( ) 


t 


WELL ORGANIZED? ( ) ( ) 


• 


WELL ILLUSTRATED? ( ) ( ) 


• 


WELL INDEXED? ( ) ( ) 


•• 


EASY TO READ? ( ) ( ) 


• 


EASY TO UNDERSTAND? ( ) ( ) 



Please indicate below whether your 
comment pertains to an addition, 
deletion, change or error; and, where 
applicable, please refer to specific 
page numbers. 



Page 



Description of error or deficiency 



From: 



Name 

Firm 

Address _ 
Telephone 



Title 

Department _ 
City, State 
Date 
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